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Preface 


The central role of quantum mechanics, as a unifying principle in contemporary 
physics, is reflected in the training of physicists who take a common course, whether 
they expect to specialize in atomic, molecular, nuclear, or particle physics, solid 
state physics, quantum optics, quantum electronics, or quantum chemistry. This book 
was written for such a course as a comprehensive introduction to the principles of 
quantum mechanics and to their application in the subfields of physics. 

The first edition of this book was published in 1961, the second in 1970. At 
that time there were few graduate-level texts available to choose from. Now there 
are many, but I was encouraged by colleagues and students to embark on a further 
revision of this book. While this new updated edition differs substantially from its 
predecessors, the underlying purpose has remained the same: To provide a carefully 
structured and coherent exposition of quantum mechanics; to illuminate the essential 
features of the theory without cutting corners, and yet without letting technical de- 
tails obscure the main storyline; and to exhibit wherever possible the common 
threads by which the theory links many different phenomena and subfields. 

The reader of this book is assumed to know the basic facts of atomic and sub- 
atomic physics and to have been exposed to elementary quantum mechanics at the 
undergraduate level. Knowledge of classical mechanics and some familiarity with 
electromagnetic theory are also presupposed. My intention was to present a self- 
contained narrative, limiting the selection of topics to those that could be treated 
equitably without relying on specialized background knowledge. 

The material in this book is appropriate for three semesters (or four quarters). 
The first 19 chapters can make up a standard two-semester (or three-quarter) course 
on nonrelativistic quantum mechanics. Sometimes classified as “Advanced Quantum 
Mechanics” Chapters 20-24 provide the basis for an understanding of many-body 
theories, quantum electrodynamics, and relativistic particle theory. The pace quick- 
ens here, and many mathematical steps are left to the exercises. It would be pre- 
sumptuous to claim that every section of this book is indispensable for learning the 
principles and methods of quantum mechanics. Suffice it to say that there is more 
here than can be comfortably accommodated in most courses, and that the choice of 
what to omit is best left to the instructor. 

Although my objectives are the same now as they were in the earlier editions, 
I have tried to take into account changes in physics and in the preparation of the 
students. Much of the first two-thirds of the book was rewritten and rearranged while 
I was teaching beginning graduate students and advanced undergraduates. Since 
most students now reach this course with considerable previous experience in quan- 
tum mechanics, the graduated three-stage design of the previous editions — wave 
mechanics, followed by spin one-half quantum mechanics, followed in turn by the 
full-fledged abstract vector space formulation of quantum mechanics — no longer 
seemed appropriate. In modifying it, I have attempted to maintain the inductive 
approach of the book, which builds the theory up from a small number of simple 
empirical facts and emphasizes explanations and physical connections over pure 
formalism. Some introductory material was compressed or altogether jettisoned to 
make room in the early chapters for material that properly belongs in the first half 
of this course without unduly inflating the book. I have also added several new topics 
and tried to refresh and improve the presentation throughout. 
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As before, the book begins with ordinary wave mechanics and wave packets 
moving like classical particles. The Schrodinger equation is established, the prob- 
ability interpretation induced, and the facility for manipulating operators acquired. 
The principles of quantum mechanics, previously presented in Chapter 8, are now 
already taken up in Chapter 4. Gauge symmetry, on which much of contemporary 
quantum field theory rests, is introduced at this stage in its most elementary form. 
This is followed by practice in the use of fundamental concepts (Chapters 5, 6, and 
7), including two-by-two matrices and the construction of a one-dimensional version 
of the scattering matrix from symmetry principles. Since the bra-ket notation is 
already familiar to all students, it is now used in these early chapters for matrix 
elements. The easy access to computing has made it possible to beef up Chapter 7 
on the WKB method. 

In order to enable the reader to solve nontrivial problems as soon as possible, 
the new Chapter 8 is devoted to several important techniques that previously became 
available only later in the course: Variational calculations, the Rayleigh-Ritz 
method, and elementary time-independent perturbation theory. A section on the use 
of nonorthogonal basis functions has been added, and the applications to molecular 
and condensed-matter systems have been revised and brought together in this 
chapter. 

The general principles of quantum mechanics are now the subject of Chapters 
9 and 10. Coherent and squeezed harmonic oscillator states are first encountered 
here in the context of the uncertainty relations. Angular momentum and the nonre- 
lativistic theory of spherical potentials follow in Chapters 11 and 12. Chapter 13 on 
scattering begins with a new introduction to the concept of cross sections, for col- 
liding and merging beam experiments as well as for stationary targets. 

Quantum dynamics, with its various “pictures” and representations, has been 
expanded into Chapters 14 and 15. New features include a short account of Feynman 
path integration and a longer discussion of density operators, entropy and infor- 
mation, and their relation to notions of measurements in quantum mechanics. All of 
this is then illustrated in Chapter 16 by the theory of two-state systems, especially 
spin one-half (previously Chapters 12 and 13). From there it’s a short step to a 
comprehensive treatment of rotations and other discrete symmetries in Chapter 17, 
ending on a brief new section on non-Abelian local gauge symmetry. Bound-state 
and time-dependent perturbation theories in Chapters 18 and 19 have been thor- 
oughly revised to clarify and simplify the discussion wherever possible. 

The structure of the last five chapters is unchanged, except for the merger of 
the entire relativistic electron theory in the single Chapter 24. In Chapter 20, as a 
bridge from elementary quantum mechanics to general collision theory, scattering 
is reconsidered as a transition between free particle states. Those who do not intend 
to cross this bridge may omit Chapter 20. The quantum mechanics of identical par- 
ticles, in its “second quantization” operator formulation, is a natural extension of 
quantum mechanics for distinguishable particles. Chapter 21 spells out the simple 
assumptions from which the existence of two kinds of statistics (Bose-Einstein and 
Fermi-Dirac) can be inferred. Since the techniques of many-body physics are now 
accessible in many specialized textbooks, Chapter 22, which treats some sample 
problems, has been trimmed to focus on a few essentials. 

Counter to the more usual quantization of the classical Maxwell equations, 
Chapter 23 starts with photons as fundamental entities that compose the electro- 
magnetic field with its local dynamical properties like energy and momentum. The 
interaction between matter and radiation fields is treated only in first approximation, 
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leaving all higher-order processes to more advanced textbooks on field theory. The 
introduction to the elements of quantum optics, including coherence, interference, 
and statistical properties of the field, has been expanded. As a paradigm for many 
other physical processes and experiments, two-slit interference is discussed repeat- 
edly (Chapters 1, 9, and 23) from different angles and in increasing depth. 

In Chapter 24, positrons and electrons are taken as the constituents of the rel- 
ativistic theory of leptons, and the Dirac equation is derived as the quantum field 
equation for charged spin one-half fermions moving in an external classical electro- 
magnetic field. The one-particle Dirac theory of the electron is then obtained as an 
approximation to the many-electron-positron field theory. 

Some important mathematical tools that were previously dispersed through the 
text (Fourier analysis, delta functions, and the elements of probability theory) have 
now been collected in the Appendix and supplemented by a section on the use of 
curvilinear coordinates in wave mechanics and another on units and physical con- 
stants. Readers of the second edition of the book should be cautioned about a few 
notational changes. The most trivial but also most pervasive of these is the replace- 
ment of the symbol fx for particle mass by m, or m e when it’s specific to an electron 
or when confusion with the magnetic quantum number lurks. 

There are now almost seven hundred exercises and problems, which form an 
integral part of the book. The exercises supplement the text and are woven into it, 
filling gaps and illustrating the arguments. The problems, which appear at the end 
of the chapters, are more independent applications of the text and may require more 
work. It is assumed that students and instructors of quantum mechanics will avail 
themselves of the rapidly growing (but futile to catalog) arsenal of computer soft- 
ware for solving problems and visualizing the propositions of quantum mechanics. 
Computer technology (especially MathType@ and Mathematica @ ) was immensely 
helpful in preparing this new edition. The quoted references are not intended to be 
exhaustive, but the footnotes indicate that many sources have contributed to this 
book and may serve as a guide to further reading. In addition, I draw explicit atten- 
tion to the wealth of interesting articles on topics in quantum mechanics that have 
appeared every month, for as long as I can remember, in the American Journal of 
Physics. 

The list of friends, students, and colleagues who have helped me generously 
with suggestions in writing this new edition is long. At the top I acknowledge the 
major contributions of John P. Hernandez, Paul S. Hubbard, Philip A. Macklin, John 
D. Morgan, and especially Eric Sheldon. Five seasoned anonymous reviewers gave 
me valuable advice in the final stages of the project. I am grateful to Mark D. 
Hannam, Beth A. Kehler, Mary A. Scroggs, and Paul Sigismondi for technical as- 
sistance. Over the years I received support and critical comments from Carl Adler, 
A. Ajay, Andrew Beckwith, Greg L. Bullock, Alan J. Duncan, S. T. Epstein, Heidi 
Fearn, Colleen Fitzpatrick, Paul H. Frampton, John D. Garrison, Kenneth Hartt, 
Thomas A. Kaplan, William C. Kerr, Carl Lettenstrom, Don H. Madison, Kirk 
McVoy, Matthew Merzbacher, Asher Peres, Krishna Myneni, Y. S. T. Rao, Charles 
Rasco, G. G. Shute, John A. White, Rolf G. Winter, William K. Wootters, and Paul 
F. Zweifel. I thank all of them, but the remaining shortcomings are my responsibility. 

Most of the work on this new edition of the book was done at the University 
of North Carolina at Chapel Hill. Some progress was made while I held a U.S. Senior 
Scientist Humboldt Award at the University of Frankfurt, during a leave of absence 
at the University of Stirling in Scotland, and on shorter visits to the Institute of 
Theoretical Physics at Santa Barbara, the Institute for Nuclear Theory in Seattle, 
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and TRIFORM Camphill Community in Hudson, New York. The encouragement of 
colleagues and friends in all of these places is gratefully acknowledged. But this 
long project, often delayed by other physics activities and commitments, could never 
have been completed without the unfailing patient support of my wife, Ann. 


Eugen Merzbacher 
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CHAPTER 1 


Introduction to Quantum Mechanics 


Quantum mechanics is the theoretical framework within which it has been 
found possible to describe, correlate, and predict the behavior of a vast 
range of physical systems, from particles through nuclei, atoms and 
radiation to molecules and condensed matter. This introductory chapter 
sets the stage with a brief review of the historical background and a 
preliminary discussion of some of the essential concepts. 1 


1. Quantum Theory and the Wave Nature of Matter. Matter at the atomic and 
nuclear or microscopic level reveals the existence of a variety of particles which are 
identifiable by their distinct properties, such as mass, charge, spin, and magnetic 
moment. All of these seem to be of a quantum nature in the sense that they take on 
only certain discrete values. This discreteness of physical properties persists when 
particles combine to form nuclei, atoms, and molecules. 

The notion that atoms, molecules, and nuclei possess discrete energy levels is 
one of the basic facts of quantum physics. The experimental evidence for this fact 
is overwhelming and well known. It comes most directly from observations on in- 
elastic collisions (Franck-Hertz experiment) and selective absorption of radiation, 
and somewhat indirectly from the interpretation of spectral lines. 

Consider an object as familiar as the hydrogen atom, which consists of a proton 
and an electron, bound together by forces of electrostatic attraction. The electron 
can be removed from the atom and identified by its charge, mass, and spin. It is 
equally well known that the hydrogen atom can be excited by absorbing certain 
discrete amounts of energy and that it can return the excitation energy by emitting 
light of discrete frequencies. These are empirical facts. 

Niels Bohr discovered that any understanding of the observed discreteness 
requires, above all, the introduction of Planck’s constant , h = 6.6261 X 10 -34 
J sec = 4.136 X 10 15 eV sec. In the early days, this constant was often called the 
quantum of action. By the simple relation 



( 1 . 1 ) 


it links the observed spectral frequency v to the jump A E between discrete energy 
levels. Divided by 2tt, the constant h = hllrr appears as the unit of angular mo- 
mentum, the discrete numbers nh ( n = 0, 1/2, 1, 3/2, 2, . . .) being the only values 
which a component of the angular momentum of a system can assume. All of this 
is true for systems that are composed of several particles, as well as for the particles 
themselves, most of which are no more “elementary” than atoms and nuclei. The 
composite structure of most particles has been unraveled by quantum theoretic 


'Many references to the literature on quantum mechanics are found in the footnotes, and the 
bibliographic information is listed after the Appendix. It is essential to have at hand a current summary 
of the relevant empirical knowledge about systems to which quantum mechanics applies. Among many 
good choices, we mention Haken and Wolf (1993), Christman (1988), Krane (1987), and Perkins (1982). 
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analysis of “spectroscopic” information accumulated in high-energy physics 
experiments. 

Bohr was able to calculate discrete energy levels of an atom by formulating a 
set of quantum conditions to which the canonical variables q { and p t of classical 
mechanics were to be subjected. For our purposes, it is sufficient to remember that 
in this “old quantum theory” the classical phase (or action) integrals for a condi- 
tionally periodic motion were required to be quantized according to 

(j> ptdqi = n t h (1.2) 

where the quantum numbers n, are integers, and each contour integral is taken over 
the full period of the generalized coordinate q t . The quantum conditions (1.2) gave 
good results in calculating the energy levels of simple systems but failed when 
applied to such systems as the helium atom. 

Exercise 1.1. Calculate the quantized energy levels of a linear harmonic os- 
cillator of angular frequency a> in the old quantum theory. 

Exercise 1.2. Assuming that the electron moves in a circular orbit in a Cou- 
lomb field, derive the Balmer formula for the spectrum of hydrogenic atoms from 
the quantum condition (1.2) and the Bohr formula (1.1). 

It is well known that (1.1) played an important role even in the earliest forms 
of quantum theory. Einstein used it to explain the photoelectric effect by inferring 
that light, which through the nineteenth century had been so firmly established as a 
wave phenomenon, can exhibit a particle-like nature and is emitted or absorbed only 
in quanta of energy. Thus, the concept of the photon as a particle with energy E = 
hv emerged. The constant h connects the wave ( v ) and particle ( E ) aspects of light. 

Louis de Broglie proposed that the wave-particle duality is not a monopoly of 
light but is a universal characteristic of nature which becomes evident when the 
magnitude of h cannot be neglected. He thus brought out a second fundamental fact, 
usually referred to as the wave nature of matter. This means that in certain experi- 
ments beams of particles with mass give rise to interference and diffraction phe- 
nomena and exhibit a behavior very similar to that of light. Although such effects 
were first produced with electron beams, they are now commonly observed with 
slow neutrons from a reactor. When incident on a crystal, these behave very much 
like X rays. Heavier objects, such as entire atoms and molecules, have also been 
shown to exhibit wave properties. Although one sometimes speaks of matter waves, 
this term is not intended to convey the impression that the particles themselves are 
oscillating in space. 

From experiments on the interference and diffraction of particles, we infer the 
very simple law that the infinite harmonic plane waves associated with the motion 
of a free particle of momentum p propagate in the direction of motion and that their 
(de Broglie) wavelength is given by 



This relation establishes contact between the wave and the particle pictures. The 
finiteness of Planck’s constant is the basic point here. For if h were zero, then no 
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matter what momentum a particle had, the associated wave would always correspond 
to A = 0 and would follow the laws of classical mechanics, which can be regarded 
as the short wavelength limit of wave mechanics in the same way as geometrical 
optics is the short wavelength limit of wave optics. A free particle would then not 
be diffracted but would go on a straight rectilinear path, just as we expect classically. 

Let us formulate this a bit more precisely. If x is a characteristic length involved 
in describing the motion of a body of momentum p, such as the linear dimension of 
an obstacle in it£ path, the wave aspect of matter will be hidden from our sight, if 


A 

x 



1 


( 1 . 4 ) 


i.e., if the quantum of action h is negligible compared with xp. Macroscopic bodies, 
to which classical mechanics is applicable, satisfy the condition xp » h extremely 
well. To give a numerical example, we note that even as light a body as an atom 
moving with a kinetic energy corresponding to a temperature of T — 10 -6 K still 
has a wavelength no greater than about a micron or 10 -6 m! We thus expect that 
classical mechanics is contained in quantum mechanics as a limiting form (h— >0). 

Indeed, the gradual transition that we can make conceptually as well as prac- 
tically from the atomic level with its quantum laws to the macroscopic level at which 
the classical laws of physics are valid suggests that quantum mechanics must not 
only be consistent with classical physics but should also be capable of yielding the 
classical laws in a suitable approximation. This requirement, which serves as a guide 
in discovering the correct quantum laws, is called the correspondence principle. 
Later we will see that the limiting process which establishes the connection between 
quantum and classical mechanics can be exploited to give a useful approximation 
for quantum mechanical problems (see WKB approximation. Chapter 7). 

We may read (1.3) the other way around and infer that, generally, a wave that 
propagates in an infinite medium has associated with it a particle, or quantum , of 
momentum p = h/\. If a macroscopic wave is to carry an appreciable amount of 
momentum, as a classical electromagnetic or an elastic wave may, there must be 
associated with the wave an enormous number of quanta, each contributing a very 
small amount of momentum. For example, the waves of the electromagnetic field 
are associated with quanta ( photons ) for which the Bohr-Einstein relation E — hv 
holds. Since photons have no mass, their energy and momentum are according to 
relativistic mechanics related by E — cp, in agreement with the connection between 
energy (density) and momentum (density) in Maxwell’s theory of the electromag- 
netic field. Reversing the argument that led to de Broglie’s proposal, we conclude 
that (1.3) is valid for photons as well as for material particles. At macroscopic 
wavelengths, corresponding to microwave or radio frequency, a very large number 
of photons is required to build up a field of macroscopically discernible intensity. 
Such a field can be described in classical terms only if the photons can act coher- 
ently. As will be discussed in detail in Chapter 23, this requirement leads to the 
peculiar conclusion that a state of exactly n photons cannot represent a classical 
field, even if n is arbitrarily large. Evidently, statistical distributions of variable 
numbers of photons must play a fundamental role in the theory. 

The massless quanta corresponding to elastic (e.g., sound) waves are called 
phonons and behave similarly to photons, except that c is now the speed of sound, 
and the waves can be longitudinal as well as transverse. It is important to remember 
that such waves are generated in an elastic medium, and not in free space. 
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2. The Wave Function and Its Meaning. As we have seen, facing us at the outset 
is the fact that matter, say an electron, exhibits both particle and wave aspects. 2 This 
duality was described in deliberately vague language by saying that the de Broglie 
relation “associates” a wavelength with a particle momentum. The vagueness re- 
flects the fact that particle and wave aspects, when they show up in the same thing 
such as the electron, are incompatible with each other unless traditional concepts of 
classical physics are modified to a certain extent. Particle traditionally means an 
object with a definite position in space. Wave means a pattern spread out in space 
and time, and it is characteristic of a wave that it does not define a location or 
position sharply. 

Historically, the need for a reconciliation of the two seemingly contradictory 
concepts of wave and particle was stressed above all by Bohr, whose tireless efforts 
at interpreting the propositions of quantum mechanics culminated in the formulation 
of a doctrine of complementarity. According to this body of thought, a full descrip- 
tion and understanding of natural processes, not only in the realm of atoms but at 
all levels of human experience, cannot be attained without analyzing the comple- 
mentary aspects of the phenomena and of the means by which the phenomena are 
observed. Although this epistemological view of the relationship between classical 
and quantal physics is no longer central to the interpretation of quantum mechanics, 
an appreciation of Bohr’s program is important because, through stimulation and 
provocation, it has greatly influenced our attitude toward the entire subject. 3 

How a synthesis of the wave and particle concepts might be achieved can, for 
a start, perhaps be understood if we recall that the quantum theory must give an 
account of the discreteness of certain physical properties, e.g., energy levels in an 
atom or a nucleus. Yet discreteness did not first enter physics with the Bohr atom. 
In classical macroscopic physics discrete, “quantized,” physical quantities appear 
naturally as the frequencies of vibrating bodies of finite extension, such as strings, 
membranes, or air columns. We speak typically of the (natural) modes of such sys- 
tems. These phenomena have found their simple explanation in terms of interference 
between incident and reflected waves. Mathematically, the discrete behavior is en- 
forced by boundary conditions: the fixed ends of the string, the clamping of the 
membrane rim, the size and shape of the enclosure that confines the air column. 
Similarly, it is tempting to see in the discrete properties of atoms the manifestations 
of bounded wave motion and to connect the discrete energy levels with standing 
waves. In such a picture, the bounded wave must somehow be related to the con- 
finement of the particle to its “orbit,” but it is obvious that the concept of an orbit 
as a trajectory covered with definite speed cannot be maintained. 

A wave is generally described by its velocity of propagation, wavelength, and 
amplitude. (There is also the phase constant of a wave, but, as we shall see later, 
for one particle this is undetermined.) Since in a standing wave it is the wavelength 
(or frequency) that assumes discrete values, it is evident that if our analogy is mean- 
ingful at all, there must be a correspondence between the energy of an atom and the 


2 It will be convenient to use the generic term electron frequently when we wish to place equal 
emphasis on the particle and wave aspects of a constituent of matter. The electron has been chosen 
only for definiteness of expression (and historical reasons). Quantum mechanics applies equally to 
protons, neutrons, mesons, quarks, and so on. 

3 For a compilation of original articles on the foundations of quantum mechanics and an extensive 
bibliography, see Wheeler and Zurek (1985). Also see the resource letters in the American Journal of 
Physics: DeWitt and Graham (1971), and L. E. Ballentine (1987). 
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wavelength of the wave associated with the particle motion. For a free particle, one 
that is not bound in an atom, the de Broglie formula (1.3) has already given us a 
relationship connecting wavelength with energy (or momentum). The connection 
between wavelength and the mechanical quantities, momentum or energy, is likely 
to be much more complicated for an electron bound to a nucleus as in the hydrogen 
atom, or for a particle moving in any kind of a potential. Erwin Schrodinger dis- 
covered the wave equation that enables us to evaluate the “proper frequencies” or 
eigenfrequencie s s of general quantum mechanical systems. 

The amplitudes or wave fields, which, with their space and time derivatives, 
appear in the Schrodinger equation, may or may not have directional (i.e., polariza- 
tion) properties. We will see in Chapter 16 that the spin of the particles corresponds 
to the polarization of the waves. However, for many purposes the dynamical effects 
of the spin are negligible in first approximation, especially if the particles move with 
nonrelativistic velocities and are not exposed to magnetic fields. We will neglect the 
spin for the time being, much as in a simple theory of wave optical phenomena, 
where we are concerned with interference and diffraction or with the geometrical 
optics limit, the transverse nature of light can often be neglected. Hence, we attempt 
to build up quantum mechanics with mass (different from zero) first by use of scalar 
waves. For particles with zero spin, for example, pions and K mesons, this gives an 
appropriate description. For particles with nonzero spin, such as electrons, quarks, 
nucleons, or muons, suitable corrections must be made later. We will also see that 
the spin has profound influence on the behavior of systems comprised of several, or 
many, identical particles. 

Mathematically, the scalar waves are represented by a function ifj(x, y, z, t), 
which in colorless terminology is called the wave function. Upon its introduction 
we immediately ask such questions as these: Is if/ a measurable quantity, and what 
precisely does it describe? In particular, what feature of the particle aspect of the 
particle is related to the wave function? 

We cannot expect entirely satisfactory answers to these questions before we 
have become familiar with the properties of these waves and with the way in which 
iff is used in calculations, but the questions can be placed in sharper focus by reex- 
amining the de Broglie relation (1.3) between wavelength, or the wave number k = 
2vl\, and particle momentum: 


,, h 2t t 

p = m = ^t 


(1.5) 


Suppose that a beam of particles having momentum p in the x direction is viewed 
from a frame of reference that moves uniformly with velocity v, along the x axis. 
For nonrelativistic velocities of the particles and for u«c, the usual Galilean trans- 
formation 

x' ~ x — vt (1.6) 


changes the particle momentum to 


p' = p — mv (1.7) 

If the particles are in free space, we must assume that the de Broglie relation is valid 
also in the new frame of reference and that therefore 




* A 


( 1 . 8 ) 
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We can easily imagine an experimental test of this relation by measuring the spacing 
of the fringes in a two-slit Young-type interference apparatus, which in its entirety 
moves at velocity v parallel to the beam. Of the outcome of such a test there can 
hardly be any doubt: The fringe pattern will broaden, corresponding to the increased 
wavelength. 

When classical elastic waves, which propagate in the “rest” frame of the me- 
dium with speed V are viewed from the “moving” frame of reference, their phase, 
<f) = kx - o)t - 2ir{x - Vt)l\, as a measure of the number of amplitude peaks and 
valleys within a given distance, is Galilean-invariant. The transformation (1.6) gives 
the connection 


2tt 

<f> = <f) ' — —— ( X ' + vt 

A 


Vt) = Y (*' 


V't) 


from which we deduce the familiar Doppler shift 



and the unsurprising result: 

A' = A 


(1.9) 


( 1 . 10 ) 

( 1 . 11 ) 


Although the invariance of the wavelength accords with our experience with elastic 
waves, it is in stark conflict with the conclusion (1.8) for de Broglie waves, the ip 
waves of quantum mechanics. What has gone awry? 

Two explanations come to mind to resolve this puzzle. In later chapters we will 
see that both are valid and that they are mutually consistent. Here, the main lesson 
to be learned is that the ip waves are unlike classical elastic waves, whose amplitude 
is in principle observable and which are therefore unchanged under the Galilean 
transformation. Instead, we must entertain the possibility that, under a Galilean 
transformation, tp changes into a transformed wave function, ip', and we must as- 
certain the transformation law for de Broglie waves. If ip cannot be a directly mea- 
surable amplitude, there is no compelling reason for it to be a real-valued function. 
We will see in Section 4.7 that by allowing ip to be complex-valued for the descrip- 
tion of free particles with momentum p, the conflict between Eqs. (1.8) and (1.11) 
can be resolved. A local gauge transformation, induced by the Galilean transfor- 
mation (1.6), will then be found to provide a new transformation rule for the phase 
of the waves, replacing (1.11) and restoring consistency with the correct quantum 
relation (1.8). 

An alternative, and in the final analysis equivalent, way to avoid the contradic- 
tion implied by Eqs. (1.8) and (1.11) is to realize that Lorentz, rather than Galilei, 
transformations may be required in spite of the assumed subluminal particle and 
frame-of-reference velocities. If the Lorentz transformation 


x' = y(x — vt) 


and 





( 1 . 12 ) 


is applied, and if Lorentz invariance (instead of Galilean invariance) of the phase 
of the ip waves is assumed, the frequency and wave number must transform relativ- 
istically as 



and o' = y(a> — vk) 


(1.13) 
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For low relative velocities (y “ 1) the second of these equations again gives the 
first-order Doppler frequency shift. However, the transformation law for the wave 
number contains a relativistic term, which was tacitly assumed to be negligible in 
the nonrelativistic regime. This relation becomes consistent with (1.5) and the non- 
relativistic equations (1.7) only if it is assumed that the frequency co of de Broglie 
waves, of which we have no direct experimental information, is related to the particle 
mass by 

ha) ~ me 2 (1-14) 

This result is eminently reasonable in a relativistic quantum mechanics. It implies 
that for nonrelativistic particles the phase velocity of the waves is 



which greatly exceeds the speed of light and which explains the need for Lorentz 
transformations under all circumstances. However, since, except for two final chap- 
ters on fully relativistic quantum mechanics for photons and electrons, our treatment 
is intended to center on nonrelativistic quantum mechanics, we will retain Galilean 
transformations and acknowledge the need to transform ip appropriately. (See Sec- 
tion 4.7 for a more detailed discussion.) 

Exercise 1.3. Compare the behavior of de Broglie waves for particles of mass 
m with the changes that the wavelength and frequency of light undergo as we look 
at a plane electromagnetic wave from a “moving” frame of reference. 


As we progress through quantum mechanics, we will become accustomed to ip 
as an important addition to our arsenal of physical concepts, in spite of its unusual 
transformation properties. If its physical significance remains as yet somewhat ob- 
scure to us, one thing seems certain: The wave function ip must in some sense be a 
measure of the presence of a particle. Thus, we do not expect to find the particle in 
those regions of space where ip = 0. Conversely, in regions of space where the 
particle may be found, i p must be different from zero. But the function ip(x, y, z, t) 
itself cannot be a direct measure of the likelihood of finding the particle at position 
x, y, z at time t. For if it were that, it would have to have a nonnegative value 
everywhere. Yet, it is impossible for ip to be nonnegative everywhere, if destructive 
interference of the ip waves is to account for the observed dark interference fringes 
and for the instability of any but the distinguished Bohr orbits in a hydrogen atom. 

In physical optics, interference patterns are produced by the superposition of 
waves of E and B, but the intensity of the fringes is measured by E 2 and B 2 . In 
analogy to this situation, we assume that the positive quantity | tp(x, y, z, t) | 2 mea- 
sures the probability of finding the particle at position x, y, z at time t. (The absolute 
value has been taken because it will turn out that ip can have complex values.) The 
full meaning of this interpretation of ip and its internal consistency will be discussed 
in detail in Chapter 3. Here we merely want to advance some general qualitative 
arguments for this so-called probability interpretation of the quantum wave function 
for particles with mass, which was introduced into quantum mechanics by Max Born. 

From all that is known to date, it is consistent with experiment and theory to 
associate the wave ip with a single particle or atom or other quantum system as a 
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representative of a statistical ensemble. Owing to their characteristic properties, such 
as charge and mass, particles can be identified singly in the detection devices of 
experimental physics. With the aid of these tools, it has been abundantly established 
that the interference fringes shown schematically in Figure 1.1 are the statistical 
result of the effect of a very large number of independent particles hitting the screen. 
The interference pattern evolves only after many particles have been deposited on 
the detection screen. Note that the appearance of the interference effects does not 
require that a whole beam of particles go through the slits at one time. In fact, 
particles can actually be accelerated and observed singly, and the interference pattern 
can be produced over a length of time, a particle hitting the screen now here, now 
there, in seemingly random fashion. When many particles have come through, a 
regular interference pattern will be seen to have formed. The conclusion is almost 
inevitable that i jj describes the behavior of single particles, but that it has an intrinsic 
probabilistic meaning. The quantity | ip\ 2 would appear to measure the chance of 
finding the particle at a certain place. In a sense, this conclusion was already implicit 
in our earlier discussion regarding an infinite plane wave as representative of a free 
particle with definite momentum (wavelength) but completely indefinite position. At 
least if i/f is so interpreted, the observations can be correlated effortlessly with the 
mathematical formalism. These considerations motivate the more descriptive name 
probability amplitude for the wave function y, z, t). As a word of caution, we 
note that the term amplitude, as used in quantum mechanics, generally refers to the 
spacetime-dependent wave function ij/, and not merely to its extreme value, as is 
customary in speaking about elastic or electromagnetic waves. 

The indeterminism that the probabilistic view ascribes to nature, and that still 
engenders discomfort in some quarters, can be illustrated by the idealized experiment 
shown in Figure 1.1. Single particles are subject to wave interference effects, and 



/(r, t) 


Figure 1.1, Schematic diagram of the geometry in a two-slit experiment. A plane wave, 
depicted by surfaces of equal phase, is incident with wavelength A = 2-7 rlk from the left 
on the narrow slits A and B. The amplitude and intensity at the spacetime point P(r,t), 
at a distance s, = AP and s 2 = BP from the slits, depends on the phase difference 
S(r,t) = k(s^ — s 2 ) = k (AC). A section of the intensity profile I(r,t) is shown, but all 
scales on this figure are distorted for emphasis. Bright fringes appear at P if ~ s 2 \ 
equals an integral multiple of the wavelength. 
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some are found deposited on the screen at locations that they could not reach if they 
moved along classical paths through either slit. The appearance of the interference 
fringes depends on the passage of the wave through both slits at once, just as a light 
wave goes through both slits in the analogous optical Young interference experi- 
ment. If the wave describes the behavior of a single particle, it follows that we cannot 
decide through which of the two slits the particle has gone. If we try to avoid this 
consequence by determining experimentally with some subtle monitoring device 
through which S slit the particle has passed, we change the wave drastically and de- 
stroy the interference pattern. A single particle now goes definitely through one slit 
or the other, and the accumulation of a large number of particles on the screen will 
result in two well- separated traces. Exactly the same traces are obtained if we block 
one slit at a time, thereby predetermining the path of a particle. In the language of 
the principle of complementarity, the conditions under which the interference pattern 
is produced forbid a determination of the slit through which the particle passes. 

This impressionistic qualitative description will be put on a firmer footing in 
Chapters 9 and 10, and again in Chapter 23, but the basic feature should be clear 
from the present example: Wave aspect and particle aspect in one and the same thing 
are compatible only if we forego asking questions that have no meaning (such as: 
“Let us see the interference fringes produced by particles whose paths through an 
arrangement of slits we have followed ! ” ) 

An alternative view of the probability interpretation maintains that the wave ip 
describes only the statistical behavior of a collection or ensemble of particles, al- 
lowing, at least in principle, for a more detailed and deterministic account of the 
motion of the single systems that make up the ensemble than is sanctioned by quan- 
tum mechanics. For example, in the two-slit experiment some hidden property of 
the individual particles would then presumably be responsible for the particular 
trajectories that the particles follow. Generally, there would be the potential for a 
more refined description of the individual member systems of the ensemble. From 
the confrontation between experiments on an important class of quantum systems 
and a penetrating theoretical analysis that is based on minimal assumptions (first 
undertaken by John Bell), we know that such a more “complete” description— 
broadly referred to as realistic or ontological — cannot succeed unless it allows for 
something akin to action-at-a-distance between systems that are widely separated in 
space. On the other hand, such nonlocal features arise naturally in quantum me- 
chanics, and no evidence for any underlying hidden variables has ever been found. 
Thus, it is reasonable to suppose that ip fully describes the statistical behavior of 
single systems. 

To summarize, the single-particle probability doctrine of quantum mechanics 
asserts that the indetermination of which we have just given an example is a property 
inherent in nature and not merely a profession of our temporary ignorance from 
which we expect to be relieved by a future and more complete description. This 
interpretation thus denies the possibility of a more “complete” theory that would 
encompass the innumerable experimentally verified predictions of quantum mechan- 
ics but would be free of its supposed defects, the most notorious “imperfection” of 
quantum mechanics being the abandonment of strict classical determinism. Since 
the propositions of quantum mechanics are couched in terms of probabilities, the 
acquisition, processing, and evaluation of information are inherent in the theoretical 
description of physical processes. From this point of view, quantum physics can be 
said to provide a more comprehensive and complete account of the world than that 
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ispired to by classical physics. Loose talk about these issues may lead to the im- 
pression that quantum mechanics must forsake the classical goal of a wholly rational 
lescription of physical processes. Nothing could be further from the truth, as this 
pook hopes to demonstrate . 4,5 


4 The following list is a sample of books that will contribute to an understanding of quantum 
nechanics, including some by the pioneers of the field: 

E, Ballentine, Quantum Mechanics, Prentice Hall, Englewood Cliffs, N.J., 1990. 

V Bohm, Quantum Mechanics, 2nd ed.. Springer- Verlag, Berlin, 1986. 

■Jiels Bohr, Atomic Physics and Human Knowledge, John Wiley, New York, 1958. 

*. A. M. Dirac, The Principles of Quantum Mechanics, 4th ed., Clarendon Press, Oxford, 1958. 

Curt Gottfried, Quantum Mechanics, Volume I, W. A. Benjamin, New York, 1966. 

I. S. Green, Matrix Mechanics, Noordhoff, Groningen, 1965. 

Valter Greiner and Berndt Muller, Quantum Mechanics, Symmetries, Springer- Verlag, Berlin, 1989. 
Verner Heisenberg, The Physical Principles of the Quantum Theory, University of Chicago Press, 
1930, translated by C. Eckart and C. Hoyt, Dover reprint, 1949. 
larry Holstein, Topics in Advanced Quantum Mechanics, Addison-Wesley, Reading, Mass., 1992. 

Tax Jammer, The Conceptual Development of Quantum Mechanics, McGraw-Hill, New York, 1966. 

J. A. Kramers, Quantum Mechanics, Interscience, New York, 1957. 

D. Landau and E. M. Lifshitz, Quantum Mechanics, Addison-Wesley, Reading, Mass., 1958, 
translated by J. B. Sykes and J. S. Bell, 
lubin H. Landau, Quantum Mechanics II, John Wiley, New York, 1990. 

t.. Messiah, Quantum Mechanics, North-Holland, Amsterdam, 1961 and 1962, Vol. I translated by 
G. Temmer, Vol. II translated by J. Potter. 

toland Omnes, The Interpretation of Quantum Mechanics, Princeton University Press, 1994. 

Volfgang Pauli, Die allgemeinen Prinzipien der Wellenmechanik, Vol. 5/1 of Encyclopedia of 
Physics, pp. 1-168, Springer-Verlag, Berlin, 1958. 

. J. Sakurai, with San Fu Tuan, ed.. Modern Quantum Mechanics, revised ed., Benjamin/Cummings, 
New York, 1994. 

!. L. van der Waerden, Sources of Quantum Mechanics, North-Holland, Amsterdam, 1967. 

. M. Ziman, Elements of Advanced Quantum Theory, Cambridge University Press, 1969. 

5 Books suitable for introductory study of quantum mechanics include: 

llaude Cohen-Tannoudji, Bernard Diu, and Frank Laloe, Quantum Mechanics, Volumes I and II, 

John Wiley, New York, 1977. 

1. H. Dicke and J. P. Wittke, Introduction to Quantum Mechanics, Addison-Wesley, Reading, Mass., 
1960. 

ames M. Feagin, Quantum Mechanics with Mathematica, Springer-Verlag, 1993. 
tephen Gasiorowicz, Quantum Physics, 2nd ed., John Wiley, New York, 1996. 

Valter Greiner, Quantum Mechanics, An Introduction, Springer-Verlag, Berlin, 1989. 

). Griffiths, Introduction to Quantum Mechanics, Prentice-Hall, Englewood Cliffs, N.J., 1995. 

'homas F. Jordan, Quantum Mechanics in Simple Matrix Form, John Wiley, New York, 1986. 
lichael Morrison, Understanding Quantum Mechanics . Prentice-Hall, Englewood, N.J., 1990. 

)avid A. Park, Introduction to the Quantum Theory, 3rd ed., McGraw-Hill, New York, 1992. 

. L. Powell and B. Crasemann, Quantum Mechanics, Addison-Wesley, Reading, Mass., 1961. 

L. Shankar, Principles of Quantum Mechanics, Plenum, New York, 1980. 
lichard W. Robinett, Quantum Mechanics, Classical Results, Modern Systems, and Visualized 
Examples, Oxford University Press, 1997. 

ohn S. Townsend, A Modern Approach to Quantum Mechanics, McGraw-Hill, New York, 1992. 
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Problems 

1. To what velocity would an electron (neutron) have to be slowed down, if its wave- 
length is to be 1 meter? Are matter waves of macroscopic dimensions a real 
possibility? 

2. For the observation of quantum mechanical Bose-Einstein condensation, the inter- 
particle distance in a gas of noninteracting atoms must be comparable to the de Brog- 
lie wavelength, or less. How high a particle density is needed to achieve these con- 
ditions if the atoms have mass number A = 100 and are at a temperature of 100 
nanokelvin? 



CHAPTER 2 


Wave Packets , Free Particle Motion, 
and the Wave Equation 

Building on previous experience with wave motion and simple Fourier 
analysis, we develop the quantum mechanical description of the motion of 
free particles. The correspondence between quantum and classical motion 
serves as a guide in the construction, by superposition of harmonic waves, 
of wave packets that propagate like classical particles but exhibit quantum 
mechanical spreading in space and time. Heisenberg’s uncertainty 
relations and the Schrodinger wave equation make their first appearance. 

1. The Principle of Superposition. We have learned that it is reasonable to sup- 
pose that a free particle of momentum p is associated with a harmonic plane wave. 
Defining a vector k which points in the direction of wave propagation and has the 
magnitude 



we may write the fundamental de Broglie relation as 

p = ftk (2.2) 

The symbol h is common in quantum physics and denotes the frequently recurring 
constant 

h = — = 1.0545 X 10~ 34 J sec = 6.5819 X 10 -22 MeV sec = 0.66 X 1CT 15 eV sec 

2tt 

It is true that diffraction experiments do not give us any direct information about 
he detailed dependence on space and time of the periodic disturbance that produces 
he alternately “bright” and “dark” fringes, but all the evidence points to the cor- 
•ectness of the simple inferences embodied in (2.1) and (2.2). The comparison with 
optical interference suggests that the fringes come about by linear superposition of 
wo waves, a point of view that has already been stressed in Section 1.2 in the 
liscussion of the simple two-slit interference experiment (Figure 1.1). 

Mathematically, these ideas are formulated in the following fundamental as- 
sumption about the wave function ij/(x, y, z, t): If i/q(x, y, z, t ) and ij/ 2 (x, y , z, t) 
lescribe two waves, their sum i/j(x, y, z, t) = if/ 1 + i jj 2 also describes a possible 
physical situation. This assumption is known as the principle of superposition and 
s illustrated by the interference experiment of Figure 1.1. The intensity produced 
)n the screen by opening only one slit at a time is |^i| 2 or |i/r 2 | 2 - When both slits 
ire open, the intensity is determined by | fa + <p 2 \ 2 . This differs from the sum of 
he two intensities, |^i| 2 + |i// 2 | 2 , by the interference terms + fatyi*- (An 

isterisk will denote complex conjugation throughout this book.) 

A careful analysis of the interference experiment would require detailed con- 
lideration of the boundary conditions at the slits. This is similar to the situation 
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found in wave optics, where the uncritical use of Huygens’ principle must be jus- 
tified by recourse to the wave equation and to Kirchhoff’s approximation. There is 
no need here for such a thorough treatment, because our purpose in describing the 
idealized two-slit experiment was merely to show how the principle of superposition 
accounts for some typical interference and diffraction phenomena. Such phenomena, 
when actually observed as in diffraction of particles by crystals, are impressive direct 
manifestations of the wave nature of matter. 

We therefore adopt the principle of superposition to guide us in developing 
quantum mechanics. The simplest type of wave motion to which it will be applied 
is an infinite harmonic plane wave propagating in the positive x direction with wave- 
length A = 27 r/k and frequency co. Such a wave is associated with the motion of a 
free particle moving in the x direction with momentum p = hk, and its most general 
form is 

if/fx, t) = cos(fox — (ot) + 5 sin(fox — cot) (2.3) 

A plane wave moving in the negative x direction would be written as 

if/ 2 (x, t) — cos(— foe — cot) + 8 sin(— fox — cot) 

— cos (fox + cot ) — 5 sin(fox + cot) 

The coordinates y and z can obviously be omitted in describing these one-dimen- 
sional processes. 

An arbitrary displacement of x or t should not alter the physical character of 
these waves, which describe, respectively, a particle moving uniformly in the pos- 
itive and negative x directions, nor should the phase constants of these waves have 
any physical significance. Hence, it must be required that 

cos(fcc — cot + s) + 8 sin(fcc — cot + e) = a(e)[cos(fox — cot) + 8 sin(fox — <ut)] 

for all values of x, t, and e. Comparing coefficients of cos(fox — cot) and of 
sin(fox - cot), we find that this last equation leads to 

cos e + 8 sin e = a and 8 cos e — sin s = a.8 

These equations are compatible for all e only if 8 2 = — 1 or 8 = ±i. We choose the 
solution 8 = i, a = e ,E and are thus led to the conclusion that if/ waves describing 
free particle motion must in general be complex. We have no physical reason for 
rejecting complex-valued wave functions because, unlike elastic displacements or 
electric field vectors, the if/ waves of quantum mechanics cannot be observed di- 
rectly, and the observed diffraction pattern presumably measures only the intensity 

W- 

Summarizing our conclusions, we see that in this scheme an infinite harmonic 
plane wave propagating toward increasing x is 

<A,(x, t) = Ae i(kx - W,) (2.5) 

and a wave propagating toward decreasing x is 

ip 2 (x, t) = Be~ i<kx+at) = Be*-*-"* (2.6) 

The initial values of these waves are 

if/ 1 (x, 0) = Ae ,kx and if/ 2 (x, 0) = Be~ ikx 

respectively. The (circular) frequency of oscillation is co, which thus far we have 
not brought into connection with any physically observable phenomenon. Generally, 
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t will be a function of k (see Section 2.3). We will refer to wave functions like (2.5) 
nd (2.6) briefly as plane waves. 

A plane wave propagating in an arbitrary direction has the form 

if/(r, t ) = if/(x, y, z, t) = Ae i(k - r - Mt) = Ae i( ^ + V+^-^ (2.7) 

Equations (2.5) and (2.6) are special cases of this with k y = k z = 0 and k x — ±k. 

It should be stressed that with the acceptance of complex values for if/, we are 
y no means excluding wave functions that are real-valued, at least at certain times, 
'or example, if the initial wave function is if/(x, 0) = cos kx, this can be written as 
le sum of two exponential functions: 

if/(x, 0) = cos kx = - e ,kx + - e~ lkx (2.8) 

according to the principle of superposition, this must be an acceptable wave func- 
Lon. How does it develop in time? The principle of superposition suggests a simple 
nswer to this important question: Each of the two or more waves, into which if/ can 
e decomposed at t = 0, develops independently, as if the other component(s) were 
ot present. The wave function, which at t = 0 satisfies the initial condition i f/(x, 0) 
= cos kx, thus becomes for arbitrary t: 

i f/(x, t) = - e i(kx ~ Mt) + - e ~ Kkx+wt) (2.9) 

2 2 

iote that this is not the same as cos(kx — cot). This rule, to which we shall adhere 
nd which we shall generalize, ensures that i f/(x, 0) determines the future behavior 
f the wave uniquely. If this formulation is correct, we expect that the complex if/ 
mves may be described by a linear differential equation which is of the first order 
l time. 1 

. Wave Packets and the Uncertainty Relations. The foregoing discussion points 
) the possibility that by allowing the wave function to be complex, we might be 
ble to describe the state of motion of a particle at time t completely by if/(r, t ). The 
jal test of this assumption is, of course, its success in accounting for experimental 
bservations. Strong support for it can be gained by demonstrating that the corre- 
pondence with classical mechanics can be established within the framework of this 
leory. 

To this end, we must find a way of making if/(r, t ) describe, at least approxi- 
lately, the classical motion of a particle that has both reasonably definite position 
nd reasonably definite momentum. The plane wave (2.7) corresponds to particle 
lotion with momentum, which is precisely defined by (2.2); but having amplitudes 
p\ = const, for all r and t, the infinite harmonic plane waves (2.7) leave the position 
f the particle entirely unspecified. By superposition of several different plane 
'aves, a certain degree of localization can be achieved, as the fringes on the screen 
l the diffraction experiment attest. 


'Maxwell’s equations are of the first order even though the functions are real, but this is accom- 
lished using both E and B in coupled equations, instead of describing the field by just one function, 
he heat flow equation is also of the first order in time and describes the behavior of a real quantity, 
le temperature; but none of its solutions represent waves propagating without damping in one direction 
ke (2.7). 
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The mathematical tools for such a synthesis of localized compact wave packets 
by the superposition of plane waves of different wave number are available in the 
form of Fourier analysis. Assuming for simplicity that only one spatial coordinate, 
x, need be considered, we can write 


&- 


0 ) 



r. 


4>(K)e ik ^ 


dk x 


and the inverse formula 


( 2 . 10 ) 



( 2 . 11 ) 


under very general conditions to be satisfied by ip(x, 0). 2 Formulas (2.10) and (2.1 1) 
show that (f) ( k x ) determines the initial wave function ip(x, 0) and vice versa. 

We now assume that 4>(k x ) is a real positive function and that it has a shape 
similar to Figure 2.1, i.e., a symmetric distribution of k x about a mean value k x . 
Making the change of variable 



we may write 


^ 0) V2^ 

This is a wave packet whose absolute value is shown in Figure 2.2. The particle is 
most likely to be found at a position where ip is appreciable. It is easy to see for 
any number of simple examples that the width A k x of the amplitude (p and the width 
Ax of the wave packet ip stand in a reciprocal relationship: 

Ax Ak x = 1 (2.13) 

For a proof of the uncertainty relation (2.13), note that the integral in (2.12) is an 
even real function of x. Let us denote by (— x 0 , x 0 ) the range of x for which ip is 
appreciably different from zero (see Figure 2.2). Since <p is appreciably different 
from zero only in a range A k x centered at u — 0, where k x — k x , the phase of e ,ux in 
the integrand varies from — x Ak x /2 to + x £sk x l2, i.e., by an amount x A k x for any 
fixed value of x. If x 0 A k x is less than =1, no appreciable cancellations in the inte- 
grand occur (since (p is positive definite). On the other hand, when x 0 A k x » 1, the 
phase goes through many periods as u ranges from — A^/2 to +A&*/2 and violent 
oscillations of the e lux term occur, leading to destructive interference. Hence, the 
largest variations that the phase ever undergoes are ±x 0 A k x ; this happens at the 
ends of the wave packet. Denoting by Ax the range (-x 0 , x 0 ), it follows that the 
widths of | ip | and <p are effectively related by (2.13). 




• 4 - 

J ~ c 


(p(u + k x )e iux du 


(2.12) 


2 For a review of Fourier analysis, see any text in mathematical physics, e.g., Bradbury (1984), 
Arfken (1985), and Hassani (1991). For Eq. (2.11) to be the inverse of (2.10), it is sufficient that 
0) be a function that is sectionally continuous. The basic formulas are summarized in Section 1 
of the Appendix. 
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4>(k x ) 



figure 2.1. Example of a one-dimensional momentum distribution. The function <f>(k x ) = 
\fail [cosh a ( k x — k 0 )] _l represents a wave packet moving in the positive x direction with 
nean wave number k 0 = 3. The function <fi(k x ) is normalized so that Ji°°| </>(£*) | 2 dk x — 1. 
The two functions correspond to a = 0.72 and V 7 r (heavy line). 





figure 2.2. Normalized wave packet corresponding to the momentum distributions of 


figure 2.1. The plotted amplitude, | ip(x) \ = 


7 TX 


cosh — ) , modulates the plane wave 


.<*6* N ote the reciprocity of the widths of the corresponding curves in Figures 2. 1 and 2.2. 


Exercise 2.1. Assume 4>(k x ) = V27T for k x — 8 < k x ^ k x + 8, and 4>(k x ) = 
) for all other values of k x . Calculate 1 (f(x, 0), plot 1 1 f/(x, 0) | 2 for several values of 
), and show that (2.13) holds if Ax is taken as the width at half maximum. 3 

Exercise 2.2. Assume t/^(jc, 0) = for —00 < x < + 00 . Calculate 4>(k x ) 
ind show that the widths of if/ and (f), reasonably defined, satisfy the reciprocal 
elation (2.13). 


3 Note that if (Ax) 2 is taken as the variance of x, as in Section 10.5, its value will be infinite for 
he wave packets defined in Exercises 2.1 and 2.3. 
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The initial wave function if/(x, 0) thus describes a particle that is localized 
within a distance Ax about the coordinate origin. This has been accomplished at the 
expense of combining waves of wave numbers in a range t±k x about k x . The relation 
(2.13) shows that we can make the wave packet define a position more sharply only 
at the cost of broadening the spectrum of ^-values which must be admitted. Any 
hope that these consequences of (2.13) might be averted by choosing <£(£*) more 
providentially, perhaps by making it asymmetric or by noting that in general it is 
complex-valued, has no basis in fact. On the contrary, we easily see (and we will 
rigorously prove in Chapter 10) that in general 

Ax A k x > (2.14) 

The product Ax A k x assumes a value near its minimum of 1/2 only if the absolute 

value of <f> behaves as in Figure 2.1 and its phase is either constant or a linear 

function of k x . 

Exercise 2.3. Assume 

8 8 
cf>(k x ) = V2tt for k x - - < k x < k x + - 

— —8 — 8 — 

= — V277 for k x — 8 < k x < k x — — and k x + - < k x < k x + 8 

A 

and cf) — 0 for all other values of k x . Calculate if/(x, 0) and show that, although the 
width of the /^-distribution is the same as in Exercise 2.1, the width of if/(x, 0) is 
greater in accordance with the inequality (2.14). 

The fact that in quantum physics both waves and particles appear in the de- 
scription of the same thing has already forced us to abandon the classical notion 
that position and momentum can be defined with perfect precision simultaneously. 
Equation (2.14) together with the equation p x — hk x expresses this characteristic 
property of quantum mechanics quantitatively: 

Ax A Px >^ (2.15) 

This uncertainty relation suggests that when if/ is the sum of plane waves with 
different values of k x , the wave function cannot represent a particle of definite mo- 
mentum (for if it did if/ would have to be a harmonic plane wave). It also suggests 
that its spread-out momentum distribution is roughly pictured by the behavior of 
1 4>\ 2 . If | <f)\ 2 has its greatest magnitude at k x , the particle is most likely to have the 
momentum hk x . Obviously, we will have to make all these statements precise and 
quantitative as we develop the subject. 

Exercise 2.4. By choosing reasonable numerical values for the mass and ve- 
locity, convince yourself that (2.15) does not in practice impose any limitations on 
the precision with which the position and momentum of a macroscopic body can be 
determined. 

For the present, it suffices to view a peak in if/ as a crudely localized particle 
and a peak in (f> as a particle moving with an approximately defined velocity. The 
uncertainty relation (2.15) limits the precision with which position and momentum 
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can be simultaneously ascribed to the particle. Generally, both quantities are fuzzy 
and indeterminate ( Heisenberg uncertainty principle). 

We have already discussed in Chapter 1 what this means in terms of experi- 
ments. The function \(p\ 2 is proportional to the probability of finding the particle at 
position x. Yet upon measurement the particle will always be found to have a definite 
position. Similarly, quantum mechanics does not deny that precision measurements 
of momentum are feasible even when the particle is not represented by a plane wave 
of sharp momentum. Rather, \(f>\ 2 is proportional to the probability of finding the 
particle to have momentum hk x . 

Under these circumstances, must we admit that the particle described by (or 
i f/) has no definite momentum (or position) when we merely are unable to determine 
a single value consistently in making the same measurement on identically prepared 
systems? Could the statistical uncertainties for the individual systems be reduced 
below their quantum mechanical values by a greater effort? Is there room in the 
theory for supplementing the statistical quantum description by the specification of 
further (“hidden”) variables, so that two systems that are in the same quantal state 
(if/ or cf>) may be found to be distinct in a more refined characterization? 

These intriguing questions have been hotly debated since the advent of quantum 
mechanics. 4 We now know ( Bell’s theorem ) that the most natural kinds of hidden- 
variable descriptions are incompatible with some of the subtle predictions of quan- 
tum mechanics. Since these predictions have been borne out experimentally to high 
accuracy, we adopt as the central premise of quantum mechanics that 

For any given state (if/ or <f>) the measurement of a particular physical quan- 
tity results with calculable probability in a numerical value belonging to a 

range of possible measured values. 

No technical or mathematical ingenuity can presumably devise the means of 
giving a sharper and more accurate account of the physical state of a single system 
than that permitted by the wave function and the uncertainty relation. These claims 
constitute a principle which by its very nature cannot be proved, but which is sup- 
ported by the enormous number of verified consequences derived from it. 

Bohr and Heisenberg were the first to show in detail, in a number of interesting 
thought experiments, how the finite value of ft in the uncertainty relation makes the 
coexistence of wave and particle both possible and necessary. These idealized ex- 
periments demonstrate explicitly how any effort to design a measurement of the 
momentum component p x with a precision set by A p x inescapably limits the precision 
of a simultaneous measurement of the coordinate x to Ax > ft/2 A p x , and vice versa. 

Illuminating as Bohr’s thought experiments are, they merely illustrate the im- 
portant discovery that underlies quantum mechanics, namely, that the behavior of a 
material particle is described completely by its wave function if/ or (f> (suitably mod- 
ified to include the spin and other degrees of freedom, if necessary), but generally 
not by its precise momentum or position. Quantum mechanics contends that the wave 
function contains the maximum amount of information that nature allows us con- 
cerning the behavior of electrons, photons, protons, neutrons, quarks, and the like. 
Broadly, these tenets are referred to as the Copenhagen interpretation of quantum 
mechanics. 

3. Motion of a Wave Packet. Having placed the wave function in the center of 
our considerations, we must attempt to infer as much as possible about its properties. 


4 See Wheeler and Zurek (1985). 
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In three dimensions an initial wave packet if/(r, 0) is represented by the Fouriei 
integral: 5 

z, 0) = <A(r, 0) = j <p (k)e' kr J 3 A: (2.16) 

and the inverse formula 


s. 


<K k) - 


(2tt) 


. 3/2 


s 


0)e~ ik r d*r 


(2-17) 


If the particle which the initial wave function (2.16) describes is free, then by 
the rule of Section 2.1 (the superposition principle) each component plane wave 
contained in (2.16) propagates independently of all the others according to the pre- 
scription of (2.7). The wave function at time t becomes 


<A(r, t) 


(2t r) 


3/2 


j 


j'(k-r— <of) d 3 k 


(2.18) 


Formula (2.18) is, of course, incomplete until we determine and specify the depen- 
dence of oj on k. This determination will be made on physical grounds. 

If i/f is of appreciable magnitude only in coordinate ranges Ax, Ay, A z, then <f 
is sensibly different from zero only in k x lies in a range A k x , k y in a range A k y , and 
k z in a range A k x , such that 

Ax A k x > 1, Ay A k y > 1, Az A k z > 1 (2.19) 

These inequalities are generalizations of the one-dimensional case. By (2.2), Ap = 
ftAk; hence, in three dimensions the uncertainty relations are 


Ax A p x > h. Ay A p y > h, A z A p z 2: h 


( 2 . 20 ) 


Let us consider a wave packet whose Fourier inverse 4> (k) is appreciably differenl 
from zero only in a limited range Ak near the mean wave vector hk. In coordinate 
space, the wave packet if/{ r, t) must move approximately like a classical free particle 
with mean momentum hk. To see this behavior we expand <y(k) about k: 

<a(k) = «(k) + (k - k) • (V k w) k= k + • • * (2 .2i) 

= To + (k — k) • V k o> + • • • 


with obvious abbreviations. 

If we substitute the first two terms of this expansion into (2.18), we obtain 

i//(r, t) = exp (— ilo t + /k • V k w t)if/(r — V k c5 t, 0) (2.22) 


Ignoring the phase factor in front, we see that this describes a wave packet perform- 
ing a uniform translational motion, without any change of shape and with the group 
velocity 

\ g = V k (M (2.23) 

Our interpretation, based on the correspondence between classical and quantum me- 
chanics, leads us to identify the group velocity of the if/ wave with the mean particle 
velocity hk/m, where m is the mass of the particle and nonrelativistic particle motion 


5 d 3 k = dkjc dk y dk z denotes the volume element in k-space, and d 3 r = dx dy dz is the volume 
element in (position) coordinate space. 
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has been assumed. Since this relation must hold for an arbitrary choice of k, we may 
omit the averaging bars and require 

mV k to — hk (2.24) 

Integrating this, we obtain 

ft 

to(k) = — (k 2 + ky + k\) + const. (2.25) 

2m 


or 


h(o( k) 


2m 


p 

+ V = — + V 
2m 


(2.26) 


where V is a constant, which may be thought of as an arbitrary constant potential 
energy. We may assume that hat represents the particle energy, 

E = ha (2.27) 


A similar equation has already been found to hold for photons. This is not as sur- 
prising as it may seem, for (2.2) and (2.27), although obtained in this chapter for 
nonrelativistic particle mechanics, are Lorentz-invariant and thus remain valid in the 
relativistic case. Indeed, they merely express the proportionality of two four-vectors: 



(2.28) 


Hence their great universality. 


4. The Uncertainty Relations and the Spreading of Wave Packets. We must 
examine the conditions under which it is legitimate to neglect the quadratic and 
higher terms in (2.21). Without these corrections the wave packet moves uniformly 
without change of shape, and we should ask what happens in time when the neglect 
of the quadratic terms is no longer justified. 

For the nonrelativistic case cc^idereddiera^the^xpansion (2.21) actually ter- 
minates because (with V = 0) 


hk 1 hk? hk — h — 2 

w(k) = — = — + — • (k - k) + — (k - k) 2 

2m 2m m 2m 


(2.29) 


Thus, if quadratic terms are retained, the behavior of the wave packet is given by 
an exact formula. 

For simplicity, let us consider a one-dimensional wave packet for which 
$(k) = 0 unless k = kx (k y = k z = 0), in which case the neglected term in (2.21), 
multiplied by t, is 


h 


h 


— (K ~ k x ) t = — (A k x ) t = 


2m 


2m 


(A p x f 
2 mh 


(2.30) 


This term generally contributes to the exponent in the integrand of (2.18). An ex- 
ponent can be neglected only if it is much less than unity in absolute value. Thus, 
for nonrelativistic particles, the wave packet moves without appreciable change of 
shape for times t such that 


t\ « 


mh 
(A p x f 
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or 

. , . A p, , , h 

\t\ Av x = — \t\ « - — = Ax (2.31) 

m Ap x 

The product |f| Av x represents an uncertainty in the position at time t, over and 
beyond the initial uncertainty Ax, and is attributable to the spread in initial velocity. 
If | f | grows too large, condition (2.31) is violated, the second-order contribution to 
the phase can no longer be neglected, and the wave packet is broadened so that for 
very large |f[, 

Ax(t)Ak x > 1 (2.32), 

This means that as t increases from the distant past (t — » — <») to the remote future 
it — » +o°), the wave packet first contracts and eventually spreads. 

Exercise 2.5. Consider a wave packet satisfying the relation Ax A p x ~ h. 
Show that if the packet is not to spread appreciably while it passes through a fixed 
position, the condition A p x « p x must hold. 

Exercise 2.6. Can the atoms in liquid helium at 4 K (interatomic distance 
about 0.1 nanometer = 1A) be adequately represented by nonspreading wave pack- 
ets, so that their motion can be described classically? 

Exercise 2.7. Make an estimate of the lower bound for the distance Ax, within 
which an object of mass m can be localized for as long as the universe has existed 
(“lO 10 years). Compute and compare the values of this bound for an electron, a 
proton, a one-gram object, and the entire universe. 


The uncertainty relation (2.15) has a companion that relates uncertainties in 
time and energy. The kinetic energy is E — p 2 x Hm, and it is uncertain by an amount 

A £ = - A Px = v x A p x (2.33) 

m 


If a monitor is located at a fixed position, he or she “sees” the wave packet sweeping 
through his or her location. The determination of the time at which the particle passes 
the monitor must then be uncertain by an amount: 

. Ax h h 

At = — > = — 

v x v x A p x A E 


Hence, 


AE At > h (2.34) 

In this derivation, it was assumed that the wave packet does not spread appreciably 
in time At while it passes through the observer’s position. According to (2.31), this 
is assured if 


h 

— < At « 
AE 


mh 


With (2.33) this condition is equivalent to AE « E. But the latter condition must 
be satisfied if we are to be allowed to speak of the energy of the particles in a beam 
at all (rather than a distribution of energies). 
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The derivation of the time-energy uncertainty relation given here is narrow and 
recise. More generally, a similar inequality relates the indeterminancy A t in the 
me of occurrence of an event at the quantum level (e.g., the decay of a nucleus or 
article) to the spread A E in the energy associated with the process, but considerable 
are is required to establish valid quantitative statements (see Chapter 19). 

Exercise 2.8. Assume that in Figures 2.1 and 2.2 the units for the abscissa are 
l 1 (for k x ) and A (for jc), respectively. If the particles described by the wave packets 
l these figures are neutrons, compute their mean velocity (in m/sec), the mean 
inetic energy (in meV), the corresponding temperature (in Kelvin), and the energy 
pread (also in meV). Estimate the time scale for the spatial spread of this wave 
acket. 

. The Wave Equation for Free Particle Motion. Although the plane wave (Fou- 
ier) representation (2.18) gives the most general form of a free particle wave func- 
on, other representations are often useful. These are most conveniently obtained 
s solutions of a wave equation for this motion. We must find a linear partial dif- 
^rential equation that admits (2. 1 8) as its general solution, provided that the relation 
etwee n a and k is given by the dispersion formula (2.26). To accomplish this, we 
eed only establish the wave equation for the plane waves (2.7), since the linearity 
f the wave equation ensures that the superposition (2.18) satisfies the same equa- 
on. 

The wave equation for the plane waves <? , ( kr - Q,f > 5 with 

hcoik) = ~ (k 2 x + k 2 y + k 2 z ) + V 
2m 

nth constant V, is obviously 

ih d J^Jl = -L. V 2 ^(r, t) + ViK r, t ) (2.35) 

dt 2m 

& 

'his quantum mechanical wave equation is also known as the time-dependent Schw- 
inger equation for the free particle. As expected, it is a first-order differential 
quation in t, requiring knowledge of the initial wave function i//(r, 0), but not its 
erivative, for its solution. That the solutions must, in general, be complex-valued 
i again manifest from the appearance of i in the differential equation. For V = 0, 
tie quantum mechanical wave equation and the diffusion (or heat flow) equation 
ecome formally identical, but the presence of i ensures that the quantum mechanical 
/ave equation has solutions with wave character. 

The details of solving Eq. (2.35), using various different coordinate systems for 
lie spatial variables, will be taken up in later chapters. Here we merely draw atten- 
ion to an interesting alternative form of the wave equation, obtained by the trans- 
ormation 


iA(r, t) = e iS(x ’ m (2.36) 

f this expression for the wave function is substituted in Eq. (2.35), the equation 


dS(r, t ) + [VS(r, t )] 2 


dt 


2m 


V 2 S(r, t) + V\e 
2m 


iS(r,t)/h 


= 0 


(2.37) 



5 The Wave Equation for Free Particle Motion 


23 


is derived. The function S(r, t ) is generally complex, but in special cases it may be 
real, as for instance when if/{r, t ) represents a plane wave, in which case we have 

S(r, t) = hkr - hot = pr - Et (2.38) 


Equation (2.37) informs us that wherever and whenever the wave function t/t(r, t) 
does not vanish, the nonlinear differential equation 


S. 


dS{ r, Q 
dt 


[VS(r, t)f ih 

_L_^__ V a , (r , 0+v=0 


(2.39) 


must hold. If the term in this equation containing i and h, the hallmarks of quantum 
mechanics, were negligible, (2.39) would be the Hamilton-Jacobi equation for the 
Hamilton Principal Function of classical mechanics. 6 

To further explore the connection between classical and quantum mechanics for 
a free particle, we again consider the motion of a wave packet. Subject to the re- 
strictions imposed by the uncertainty relations (2.19), we assume that both if/ and (f> 
are fairly localized functions in coordinate and k-space, respectively. In one dimen- 
sion, Figures 2.1 and 2.2 illustrate such a state. The most popular prototype, how- 
ever, is a state whose if/ wave function at the initial time t = 0 is a plane wave of 
mean momentum hk, modulated by a real-valued Gaussian function of width Ax and 
centered at x 0 : 

if/(x, 0) = C ex p|~ ""^j + (2.40) 

(The dynamics of this particular wave packet is the content of Problem 1 at the end 
of this chapter.) 

Somewhat more generally, we consider an initial one-dimensional wave packet 
that can be written in the form 

if/(x, 0) = e lS(x,0)/h = exp[-*(x) + ik x x] (2.41) 


where ^(x) is a smooth real-valued function that has a minimum at x 0 and behaves 
as °°) — > +oo at large distances. In the approximation that underlies (2.22), for 
times short enough to permit neglect of the changing shape of the wave packet, the 
time development of if/ (x, t) = e‘ S(x ' t)lh is given by 


S(x, t) 
h 


_ -do) 

~ — o)t + k x ~=~ t + 
dk x 



do 

dk x 


+ i* x - 



(2.42) 


If the localized wave packet is broad enough so that ^-(x) changes smoothly and 
slowly over the distance of a mean de Broglie wavelength, we obtain from (2.42) 
for the partial derivatives of S(x, t), to good approximation: 


and 


dS(x, t) 
dt 


~ —ho = —E 


(2.43) 


dS(x, t ) 
dx 



(2.44) 


6 To review the elements of classical Hamiltonian mechanics, see Goldstein (1980). 
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These equations hold for values of x near the minimum of hence near the peak 
|^|. From them we deduce that in this approximation S(x, t) satisfies the classical 
Hamilton-Jacobi equation 


dS(x, t) 1 

dt 2m 


dS(x, t) 
dx 


2 

+ V = 0 


(2.45) 


This shows that for a broad class of wave packets a semiclassical approximation for 
the wave function may be used by identifying S(x, t ) with the Hamilton Principal 
Function S(x, t) which corresponds to classical motion of the wave packet. 

The present chapter shows that our ideas about wave packets can be made quan- 
titative and consistent with the laws of classical mechanics when the motion of free 
particles is considered. We must now turn to an examination of the influence of 
forces and interactions on particle motion and wave propagation. 


Problems 


1. A one-dimensional initial wave packet with a mean wave number k x and a Gaussian 
amplitude is given by 


tft(x, 0) = C exp 


4(Ax) 2 


+ ik x x 


Calculate the corresponding ^-distribution and ifj(x, t), assuming free particle motion. 
Plot 1 i l/(x, t) | 2 as a function of x for several values of t, choosing Ax small enough to 
show that the wave packet spreads in time, while it advances according to the classical 
laws. Apply the results to calculate the effect of spreading in some typical micro- 
scopic and macroscopic experiments. 

2. Express the spreading Gaussian wave function i p(x, t ) obtained in Problem 1 in the 
form ip{x, t) = exp [7 S(x, t)/h]. Identify the function Six, t) and show that it satisfies 
the quantum mechanical Hamilton-Jacobi equation. 

3. Consider a wave function that initially is the superposition of two well-separated 
narrow wave packets: 


<A i(x, 0) + i ft 2 (x, 0) 


chosen so that the absolute value of the overlap integral 



M*) dx 


is very small. As time evolves, the wave packets move and spread. Will | y(t) | increase 
in time, as the wave packets overlap? Justify your answer. 

4 . A high-resolution neutron interferometer narrows the energy spread of thermal neu- 
trons of 20 meV kinetic energy to a wavelength dispersion level of AA/X = 10~ 9 . 
Estimate the length of the wave packets in the direction of motion. Over what length 
of time will the wave packets spread appreciably? 
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The Schrodinger Equation, the Wave 
Function, and Operator Algebra 


In this chapter we study the properties and behavior of the Schrodinger 
wave function as a function of space and time, identifying it as a 
probability amplitude. As the main tool for extracting information from ■- 
the wave function, Hermitian operators and their algebraic properties are 
characterized. The advanced reader can regard this chapter and the next as 
a refresher of the concepts and principles of wave mechanics. 1 


1. The Wave Equation and the Interpretation of tf/. A straightforward gen- 
eralization of the free particle wave equation (2.35) to the case of motion of a 
particle of mass m in the field of force represented by a potential energy function 
V(x, y, z, t), which is dependent on the position r and possibly also on the time t, is 
the equation 


ih 


dtp( r, t) 
dt 


2m 


V 2 i/K r, t) + V(x, y, z, t)if/(r, t) 


(3.1) 


Schrodinger advanced this equation on the grounds that the same reasoning that led 
from (2.35) to (2.39) for a constant potential — setting if/ = e lS,fl — now implies the 
equation 



[VS] 2 

2m 


ih 

2m 


V 2 S + V(x, y, z,t) = 0 


(3.2) 


This looks like the Hamilton-Jacobi equation of classical mechanics in the presence 
of forces, supplemented again by a quantum mechanical term proportional to h. The 
existence of the factor i in this equation is of crucial importance and generally 
requires that the function S be complex- valued. More will be said in Chapter 15 
about the relation between wave mechanics and classical Hamiltonian mechanics. 

We will adopt (3.1) as the fundamental equation of nonrelativistic quantum 
mechanics for particles without spin and call it the wave equation or the time- 
dependent Schrodinger equation. In subsequent chapters, this theory will be gen- 
eralized to encompass many different and more complicated systems, but wave 
mechanics, which is the branch of quantum mechanics with a dynamical law in the 
explicit form (3.1), remains one of the simplest paradigms of the general theory. 

The ultimate justification for choosing (3.1) must, of course, come from agree- 
ment between predictions and experiment. Hence, we must examine the properties 
of this equation. Before going into mathematical details, it would seem wise, how- 
ever, to attempt to say precisely what i jj is. We are in the paradoxical situation of 
having obtained an equation that we believe is satisfied by this quantity, but of 


*An excellent modern textbook on wave mechanics is Bialynicki-Birula, Cieplak, and Kaminski 
(1992). It includes an extensive bibliography with many references to the original quantum mechanics 
literature. 
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having so far given only a deliberately vague interpretation of its physical signifi- 
cance. We have regarded the wave function as a “measure of the probability” of 
finding the particle at time t at the position r. How can this statement be made 
precise? 

Obviously, the quantity if/ itself cannot be a probability. All hopes we might 
have entertained in that direction vanished when ^became a complex function, since 
probabilities are real and positive. In the face of this dilemma, as was already ex- 
plained in Chapter 1, the next best guess is that the probability is proportional to 
1 if/\ 2 , the square of the amplitude of the wave function. 

Of course, we were careless when we used the phrase “probability of finding 
the particle at position r.” Actually, all we can speak of is the probability that the 
particle is in a volume element d 3 r which contains the point r. Hence, we now try 
the interpretation that | if/\ 2 is proportional to the probability that upon measurement 
of its position the particle will be found in the given volume element. The probability 
of finding the particle in some finite region of space is then proportional to the 
integral of if/* if/ over this region. 

The consistency of this probabilistic interpretation of the wave function requires 
that if the probability of finding the particle in some bounded region of space de- 
creases as time goes on, then the probability of finding it outside of this region must 
increase by the same amount. The probability interpretation of the if/ waves can be 
made consistently only if this conservation of probability is guaranteed. This 
requirement is fulfilled, owing to Gauss’ integral theorem, if it is possible to define 
a probability current density j, which together with the probability density p = if/* if/ 
satisfies a continuity equation 

^ + V • j = 0 (3.3) 

at 

exactly as in the case of the conservation of matter in hydrodynamics or the con- 
servation of charge in electrodynamics. 

A relation of the form (3.3) can easily be deduced from the wave equation for 
real-valued V. Multiply (3.1) on the left by if/*, and the complex conjugate of (3.1) 
by if/ on the right; subtract the two equations and make a simple transformation. The 
resulting equation 

d(if/*if/) h 

dt + W V ' [ ^* V ^ "" = 0 (3-4) 

has the form of (3.3) if the identification 

p = Cif/*if/ (3.5) 

j = C^[iA*V<A- (Vtf/*)if/] (3.6) 

2 mi 

is made. Here C is a constant. 

Exercise 3.1. Derive (3.4). Note that it depends on V being real. 

Exercise 3.2. Generalizing (3.4), prove that, again for real V, 
d(d/,*if / 2 ) h 

~ J + T~. v • - (Vfc*)fc] - 0 (3.7) 

ot 2 mi 

if if/\ and if / 2 are two solutions of (3.1). 



1 The Wave Equation and the Interpretation of ip 


27 


We have emphasized that V must be real-valued if conservation of probability 
is to hold. Complex-valued potentials are used to simulate the interaction of a par- 
ticle (e.g., a nucleon) with a complicated many-body system (nucleus), without hav- 
ing to give a detailed account of the dynamic process. The imaginary part of such 
an optical potential serves as a shorthand description of the absorption of the particle 
by the many-body system, but in principle the particle’s motion can be tracked even 
after it is “absorbed.” The true creation or annihilation of a particle, as in nuclear 
beta decay, requires generalization of the quantum mechanical formalism (see Chap- 
ter 21). 

Exercise 3.3. Show that if S(r, t) is defined by ip = e lS,h as in (2.36), the 
probability current density can be expressed as 

j - p V(Re S)/m (3.8) 


Exercise 3.4. Assume that the function </>(k) in expression (2.18) is real and 
symmetric with respect to k. Show that 

<K r, 0) = ^>o(r) 

where ip 0 (r) is a real amplitude modulating the plane wave. Calculate p and j for a 
wave packet of this form and show that for free particle motion in the approximation 
(2.22) the relation 

• P 

j = - p = vp 
m 

is obtained. Is this a reasonable result? 


So far, we have only assumed that p and j are proportional to ip and ip*. Since 
(3.1) determines tp only to within a multiplicative constant, we may set C = 1. 


p = i p*ip 


(3.5a) 


. ^ _ 

J = z : [ip*V*P 
2 mi 


(Vi p*)ip] 


(3.6a) 


and normalize the wave function by requiring 


f p d 3 r = 

I ip*ip d 3 r = 1 

Jail space 

Jail space 


(3.9) 


provided that the integral of ip*ip over all space exists (or is, in mathematical ter- 
minology, an element of the space L 2 ). From here on, all spatial integrations will be 
understood to extend over all space, unless otherwise stated, and the limits of in- 
tegration will usually be omitted. Equation (3.9) expresses the simple fact that the 
probability of finding the particle anywhere in space at all is unity. Whenever this 
integral exists, we will assume that this normalization has been imposed. Functions 

i p for which f ip*ip d 3 r exists are often called quadratically integrable. 
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If the integral (3.9) exists, and if the probability current density falls to zero 
faster than r~ 2 as r becomes very large, then Gauss’ theorem applied to (3.3) gives 


d_ 

dt 


- 


p d 3 r 


0 


(3.10) 


which simply tells us that the wave equation guarantees the conservation of nor- 
malization: If ijj was normalized at t = 0, it will remain normalized at all times. 

In light of the foregoing remarks, it is well to emphasize that throughout this 
book all integrals that are written down will be assumed to exist. In other words, 
whatever conditions i ft must satisfy for a certain integral to exist will be assumed to 
be met by the particular wave function used. This understanding will save us labo- 
rious repetition. Examples will occur presently. 

When the integral (3.9) does not exist, we may use only (3.5) and (3.6), and 
we must then speak of relative rather than absolute probabilities. Wave functions 
that are not normalizable to unity in the sense of (3.9) are as important as normal- 
izable ones. The former appear when the particle is unconfined, and in principle they 
can always be avoided by the use of finite wave packets. However, rigid insistence 
an dealing only with finite wave packets would prevent us from using such simple 
wave functions as the infinite plane wave e !kr , which describes a particle that is 
squally likely to be found anywhere in space at all — hence, of course, with zero 
absolute probability in any finite volume. Owing to the simple properties of Fourier 
integrals, mathematically this patently unphysical object is extremely convenient. It 
is an idealization of a finite, quadratically integrable, but very broad wave packet 
that contains very many wavelengths. If these limitations are recognized, the infinite 
plane matter wave is no more objectionable than an infinite electromagnetic wave 
that is also unphysical, because it represents an infinite amount of energy. We shall 
depend on the use of such wave functions. Alternative methods for normalizing them 
are discussed in Sections 4.3 and 4.4. 

One further comment on terminology: Since i//(r, t ) is assumed to contain all 
information about the state of the physical system at time t, the terms wave function 
ijj, and state may be used interchangeably. Thus, we may" speak without danger of 
confusion of a normalized state if (3.9) holds. 

It is tempting to use the definition (3.5a) of the probability density and the 
definition of the function S to express the wave function as 

tfj = e ~ lmS/h e iRcS/h = Vp e‘ RaS/h (3.11) 


The coupled equations of motion for p and the phase function R eS are easily obtained 
as the real and imaginary parts of (3.2). It follows from (3.8) that one of these is 
simply the continuity equation 


dp „ 
dt 


V(Re S ) 


m 


= 0 


(3.12) 


This equation can be interpreted as stating that probability is similar to a fluid that 
flows with velocity v = V(Re S)/m. This irrotational flow (V X v = 0) is known as 
Madelung flow. 

The second equation of motion is 


d(Re S) | [V(Rc 5)] 2 f 

Am 


" (Vp ) 2 

. 2 P 2 



- 0 


dt 


2m 


P 


(3.13) 
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or 


9(ReS) + {V(Re S)} 2 + y ft 2 V 2 Vp 
dt 2m 2m Vp 


(3.14) 


Since everything in Eqs. (3.12) and (3.13) is real- valued, it is not unreasonable to 
think of Eq. (3.13) or (3.14) as a classical Hamilton- Jacobi equation for the dynamics 
of a particle, provided that the last term is regarded as a quantum potential that must 
be added to the bonventional potential energy V. Taking his cue from these nonlinear 
equations, which are so classical in appearance, David Bohm and others advocated 
a more realistic and deterministic, or ontological, interpretation of quantum me- 
chanics than the one adopted in the present text. Whatever conceptual appeal Eqs. 
(3.12) and (3.13) may have is offset by the abstruseness of the quantum potentials 
and the complexities of the Bohm approach for many-particle and relativistic 
systems. 2 

Since the interpretation of the wave function is in terms of probabilities, the 
elements of probability theory are reviewed in Appendix A, Section 2. There it is 
assumed that the events are discrete, whereas in the application to quantum me- 
chanics continuous probability distributions such as p = 1 1 (f(x, y, z, t ) | 2 are common, 
as we will see in Section 3.2. The summations in the formulas for expectation (or 
average or mean) values must then be replaced by integrations. In quantum me- 
chanics the term expectation value is preferred when it is desirable to emphasize the 
predictive nature of the theory and the fact that the behavior of a single particle or 
physical system is involved. Although the wave function must be regarded as de- 
scribing an ensemble representing a single system, the predictions for the state can 
be compared with experimental trials only by the use of a large number of identical 
and identically prepared systems. The situation is analogous to the one encountered 
customarily in the use of probability concepts. For instance, the probability of toss- 
ing heads with a slightly unsymmetrical coin might be 0.48 and that of tossing tails 
0.52. These probabilities refer to the particular coin and to the individual tossings, 
but the experimental verification of the statement “the probability for tossing heads 
is 48 percent’’ requires that we toss many identical coins, or the same coin many 
times in succession, and 48 percent of a very large sequence of trials are expected 
to yield “heads.” 


2. Probabilities in Coordinate and Momentum Space . For a state normalized 
according to condition (3.9), the average or expectation value of the coordinate x, 
which is a random variable, is 

(x) — J x | i/t| 2 d 3 r (3.15) 


The expectation value of the position vector r, or the center of the wave packet, is 
defined by 



(3.16) 


2 For a fuller and more sympathetic assessment of David Bohm’s approach to quantum mechanics, 

see Cushing (1994). 
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An arbitrary function of r has the expectation value 

</(r)> = | /( r) \if/\ 2 d 3 r = j ip*f(r)ipd 3 r (3.17) 


The reason for writing (f) in the clumsy form 



d 3 r will become clear when 


less elementary expectation values make their appearance. 

At this point, it is not obvious how to set up the calculation of expectation 
values of momentum or kinetic energy or any other physical quantity that is not 
expressed simply in terms of the position coordinates. Since we are assuming that 
the physical state of a particle at time t is described as fully as possible by the 
normalized wave function \p{r, t), the discussion in Chapter 2 suggests that knowl- 
edge about the momentum of the particle is contained in the Fourier transform 
(pi p, t), which is related to ipir, 0 by the reciprocal Fourier transforms (see Appen- 
dix, Section 1): 


and 


*A(r, t ) 


1 

(2t rh) 3/2 


<P (p, 0 exp 



P 



(3.18) 


<P( p, 0 = 


1 

(27 Thf' 2 



i 

-i v r 



(3.19) 


In these equations we have used p rather than k = p Ih as the conjugate Fourier 
variable, but the analogous relations (2.16), (2.17), and (2,18) in k-space are recov- 
ered by the substitutions 

p — »• hk, d 3 p — »■ h 3 d 3 k, <pip) h~ 3/2 <p ik) 


Rather than introducing two different symbols for the functions describing the state 
in momentum space (p) and in k-space, we rely on the explicit notation (pip) and 
<f> (k) to remind us of the difference between them. 

We emphasize once more that the functions ip in coordinate space (or position 
space) and (p in momentum space are both equally valid descriptions of the state of 
the system. Given either one of them, the other can be calculated from (3.18) or 
(3.19). Both if/ and cp depend on the time t, which is a parameter. For the special 
case of a free particle we know from Chapter 2 how a plane wave of momentum 
p = hk develops in time and that 

<p(p, t) = f(p, Ofe-^ (3.20) 


[In Eqs. (2.16), (2.17), and (2.18), 0(k, 0) was denoted simply as <£(k).] 

For a general motion in a potential, the time development of (p (p, t ) is more 
complicated than (3.20) and must be worked out from the wave equation or its 
partner in momentum space. The latter equation is obtained from the wave equation 
in coordinate space (3.1), most easily by differentiating (3.19) with respect to t : 


ih 


d<P(P, t ) 

dt 


(27rh) 


3/2 


J £ 


., dipir, t) 

in exp 

dt 



or 


ih 


d(p( p, t ) 

dt 


(Inh) 


3/2 


M-i- 


h 2 

- V 2 + y (r) 

2m 


ip(r, t) d 3 r 
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The first term of this last expression can be transformed by integrating by parts 
twice and assuming that is subject to boundary conditions, vanishing sufficiently 
fast at large distances, so that the surface terms can be neglected. If the Fourier 
transforms (3.18) and (3.19) are then used again, after a little rearranging we obtain 


m = f *(„, o 

dt 2m 7 


+ 


1 


(2 rrh) 


3/2 


I exp^- 


^ p • rjv(r) expf p' • r U(p\ t)d 3 r d 3 p' 


(3.21) 


This mixed integral-differential equation for is the wave equation in momentum 
space, and it is fully equivalent to (3.1). It can be transformed into a differential 
equation if V is an analytic function of x, y, z, and if we can write 


exp I — - p • r jV(r) = V(ihV p ) expi 



(3.22) 


Substituting this into (3.21) and removing the differential operator with respect to 
p from the integrals, we can now perform the r-integration, and we are left with a 
delta function. This in turn makes the p-integration a trivial operation, and the result 
is a partial differential equation in momentum space: 


„ debit), t) p 2 

ih — — = — 0(p, 0 + VO'ftV p )0(p, t) 


(3.23) 


Exercise 3.5. Work through the steps leading to (3.23), using the Appendix, 
Section 1. 

If we multiply (3.21) by fi*, and the complex conjugate of (3.21) by 4>, subtract 
the two equations, and integrate the resulting equation over the entire momentum 
space (p), we obtain, after renaming some variables of the integration, 

S f 

— J <£*(P, t)<t>( p, t)d 3 p = 0 (3.24) 

This result should not be surprising because it follows directly from the Fourier 
transforms (3.18) and (3.19) that 

J l<MP> 0| 2 d 3 p = J | iA(r, t)\ 2 d 3 r (3.25) 

This equation implies that if i jj is normalized to unity for any t, (j) is automatically 
also so normalized. Equation (3.10) or (3.24) then guarantees that this normalization 
is preserved at all times. 

Since ex P^r P • represents a state with definite sharp momentum p, cf>( p) 

according to (3.18) is the amplitude with which the momentum p is represented in 
the wave function If is strongly peaked near a particular value of p, the state 
tends to be one of rather definite momentum and similar to a plane wave. This 
observation, in conjunction with the conservation property (3.24), leads us to the 
assumption that |$(p, t)| 2 d 3 p is the probability of finding the momentum of the 
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particle in the volume element d 3 p in the neighborhood of p at time t, and that 
| 4>{ p, t) | 2 is the probability density in momentum space. 

The symmetry of r and p which the present section exhibits is also evident in 
classical mechanics, especially in the Hamiltonian form, where (generalized) coor- 
dinates q and momenta p stand in a similar reciprocal relationship. However, in 
classical analytical dynamics the symmetry is primarily a formal one, for q and p 
together determine the state at time t, and classical physics tacitly assumes that they 
can both be determined simultaneously and independently with perfect precision. In 
quantum mechanics, the statistical properties of these two physical quantities are 
determined by the functions ip and (p , respectively. Since these functions are one 
another’s Fourier transforms, q and p are correlated and can no longer be measured 
or chosen entirely independently. In general, the covariance (Appendix, Section 2) 
of the random variables representing position and momentum does not vanish. The 
probability distributions [ i/>(r, r)| 2 and |<£(p, t)\ 2 are subject to the Heisenberg un- 
certainty relation, a precise statement and mathematical proof of which will be given 
in Section 10.5. 

Wave mechanics in coordinate space (ip) and wave mechanics in momentum 
space ((f)) are thus seen to be two equivalent descriptions of the same thing. As 
Chapters 9 and 1 0 will show, it is advantageous to regard both of these descriptions 
as two special representations of a general and abstract formulation of quantum 
mechanics. The terms coordinate representation and momentum representation will 
then be seen to be apt for referring to these two forms of the theory, but among the 
infinitely many possible representations no special status will be accorded them. 
Although most of the equations in wave mechanics will be written explicitly in the 
coordinate representation, any equation can be transcribed into the momentum rep- 
resentation at any desired state of a calculation. 

The probability interpretation of the momentum wave function <j> allows us to 
construct the expectation value of the momentum for any state: 


(p) = j p|<£(P. Of 2 d 3 p 


(3.26) 


In order to derive an expression for this expectation value in terms of ip, we substitute 
the complex conjugate of (3.19) into (3.26) and write 




<A*( r, t) exp^— p • rj p 0(p, t) d 3 r d 3 p 

= J J <A*(r, OyV exp^ p • r^<£(p, t) d 3 r d 3 p 

After liberal interchange of differentiations and integrations, and use of (3.18), the 
desired result 


/* 'h' 

<P> = J <A*(r, t) - Vip(r, t) d 3 r 


(3.27) 


is obtained. More generally, the same technique gives for any (analytic) function of 
momentum, g(p), 

V^jif/(r, t) d 3 r (3.28) 


<#(P)> = £(P) |0(P, Ol 2 d 3 p = if/*(r, t)g 

J J 





2 Probabilities in Coordinate and Momentum Space 


33 


where products of V’s or d/dx’s are to be understood as successive operations of 
differentiation of the function on the right, on which they act. Conversely, the ex- 
pectation value of any (analytic) function /( r) of position is given by 

</(r)> = | /( r) | tff(r, t)\ 2 d 3 r = J <p*(p, t)f(ihV p )<f)(p, t) d 3 p (3.29) 

If the physical system under consideration can be described in the language of 
classical mechanics, all quantities associated with it, such as kinetic and potential 
energy, angular momentum, and the virial, can be expressed in terms of coordinates 
and momenta. The question that then arises is how to compute the expectation value 
of an arbitrary function F(r, p). The expression for this expectation value can be 
derived to a certain extent from our previous work. 

For example, if we wish to calculate the expectation value of a mixed function 
like f(x, p y , z ), we carry out the Fourier analysis only part of the way and define a 
mixed coordinate-momentum wave function by the integral 

X(x, p y , z) = (2 ; i72 J if(r, t) exp^-y p y y^dy (3.30) 

The square of the absolute value of this function, |^(x, p y , z)\ 2 , can easily be seen 
to be the probability density of finding the particle to have coordinates x and z, but 
indeterminate y , whereas the y-component of its momentum is equal to p y . From 
this, it follows readily that the expectation value of any function of x, p y , z is 

(f(x, p y , z)) = f fix, Py, z) \x(x, Py, z) | 2 dx dp y dz 


Similar considerations hold for other combinations of components of r and p, pro- 
vided that the simultaneous specification of conjugate variables, such as x and p x , 
is avoided. As a result, we may conclude that 


(F(r, p)> - 


J t//*(r, t)F^r, y V^(r, t) d 3 r 


t)F(ihV p , p)<£(P, 0 d 3 p 



(3.31) 


However, we have not proved (3.31) generally, because no account has been taken 
of as simple a function of coordinates and momenta as F = r • p = xp x + yp y + zp z 
(which will be encountered in Section 3.6). The trouble here is that our analysis can 
never give us a wave function the square of whose absolute value would represent 
the probability density of finding the particle with coordinate x and momentum 
component p x . Heisenberg’s uncertainty principle is an articulation of this inability. 
If products like xp x do appear, we postulate that (3.31) holds, but we must keep in 
mind that classically the product xp x can be equally well written as p x x, whereas the 
quantum mechanical expectation values of xp x and p x x are generally different (and, 
in fact, not even real- valued). 

The equality of the two expressions on the right side of Eq. (3.31) for any pair 
of coordinate and momentum wave functions is equivalent to the following theorem: 

If if(r) and <£(p) are Fourier transforms of each other in the sense of (3.18) and 

/ h \ 

(3.19), then for any function F{ r, p), the coordinate function F r, -7 V ]tf/(r) and 
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the momentum function F(ihV p , p)</>(p) are Fourier transforms of each other in the 
same sense, that is, 


f(t, * v'j «K r) = I F(ihV p , p)0 (p) d 3 p (3.32) 


and 


mv p , p)0(p) = (277 -fe)3/2 / e ,Mpr 7 v j*A( r ) d * r 


(3.33) 


These formulas hold if ip and (p go to zero sufficiently fast for large values of their 
arguments, so that surface terms, resulting from integration by parts, vanish. 

/ h \ 

When Fl r, — V I acts on ip(r), a new function of the coordinates results, and 

similarly F(ihV p , p) turns (p(p) into a new function of momentum. Thus, F is an 
example of an operator that maps one function into another. 


Exercise 3.6. Derive the equality of the two expressions on the right side of 
(3.31) from the transforms (3.32) and (3.33). 


Exercise 3.7. Prove that if i/q (r) and i/z 2 ( r ) are Fourier transforms of 4>\(p) 
and </> 2 ( p), respectively, we have the identity 

J </'f(r)tp 2 (r)d 3 r = J <pf(p)<p 2 (p) d 3 p (3.34) 

which is a generalization of (3.25). 


Exercise 3.8. Show that for a linear harmonic oscillator with V = moo 2 ; c 2 /2 
the wave equations in coordinate and momentum space have the same structure and 
that every normalized solution i pi(x, t) of the wave equation in coordinate space can 
be related to a solution 


<t> 2 (Px,t) 



(3.35) 


of the wave equation in momentum space. [The different subscripts are meant as 
reminders that, generally, i/q and <£ 2 in (3.35) are not Fourier transforms of each 
other and do not represent the same state.] 


3. Operators and Expectation Values of Dynamical Variables. An operator is a 
rule that maps every function i jj (or (f > ) into a function Ftp (or F(p). An operator is 
said to be linear if its action on any two functions t/q and ip 2 is such that 

F( A«/q + pnp 2 ) = AFipi + pFip 2 (3.36) 

where A and p are arbitrary complex numbers. Derivatives are obviously linear 
operators, as are mere multipliers. Most of the operators that are relevant to quantum 
mechanics are linear. Unless it is specifically stated that a given operator is not 
linear, the term operator will henceforth be reserved for linear operators. 

The only other category of operators important in quantum mechanics is the 
antilinear variety, characterized by the property 

F( Ai/q + pip 2 ) — A*Fi/q + p*Fip 2 


(3.37) 
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Complex conjugation itself is an example of an antilinear operator. Antilinear op- 
erators are important in quantum mechanics when time-reversed physical processes 
are considered. 

Exercise 3.9. Construct some examples of linear and antilinear operators and 
of some operators that are neither linear nor antilinear. 

i 

From Eq. (3.31) we see that with any physical quantity, which is a function of 
coordinates and linear momenta, there is associated a linear operator which, when 
interposed between the wave function and its complex conjugate, upon integration 
gives the expectation value of that physical quantity. The explicit form of the op- 
erator evidently depends on whether the i/s or </> function is used to represent the 
state. The expectation value of the physical quantity F is given by the formula 

(3.38) 

The student of quantum mechanics must get used to the potentially confusing prac- 
tice of frequently denoting the mathematical operators, even in different represen- 
tations, by the same symbols (e.g., F above) as the physical quantities that they 
represent. Yet, as in most matters of notation, it is not difficult to adapt to these 
conventions; the context will always establish the meaning of the symbols unam- 
biguously. 


Table 3.1 presents some of the most important physical quantities or dynamical 

variables, and the operators that represent them. 


Table 3.1 




Physical Quantity 


Operator in Coordinate 
Representation 

Operator in Momentum 
Representation 

Position 

r 

r 

ihV p 

Momentum 

P 

— V 
i 

P 

Angular momentum 

L 

h _ 
r X - V 

i 

ihV p X p 

Kinetic energy 

T 

2m 

i_ 

2m 

Potential energy 

V 

V(r) 

V(ihV p ) 

Total energy 

E 

h 2 , 

V 2 + V(r) 

2m 

t + 


As the discussion in the last section makes clear, the expectation values of all 
of these operators are real numbers, as they should be if they are averages of physical 
quantities that are analogous to the familiar classical quantities of the same name. 

The reasonableness of the operator assignments in Table 3.1 is confirmed by 
showing that their expectation values generally satisfy the laws of classical me- 
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chanics. For example, by using the continuity equation, the time derivative of the 
mean x coordinate can be written as 

^ (x) = | x^d 3 r = -J xV • j d 3 r = -J V ■ (x j )d 3 r + j j x d 3 r = j j x d 3 r 


or 



(3.39) 


where the divergence term has been removed under the assumption that t/t vanishes 
sufficiently fast at infinity. Using (3.6a) and integration by parts, we finally obtain 

i /• > 

m — (r) = i/t* - Vi/t d 3 r = (p) (3.40) 

dt J i 


which shows that for the expectation values of wave packets the usual relation be- 
tween velocity and momentum holds. We note here that p is the (kinetic) momentum 
mv, which is the canonical momentum only if there is no electromagnetic vector 
potential, A = 0. The appropriate generalization in the case A + 0 will be dealt 
with in Section 4.6. 

The time rate of change of (p) is, by use of (3.1) and its complex conjugate, 
given by 


!<*>- 


-m f <pr - is f r- 

J dt dx J dx 


.ft 

2m 


(VV*) — - <A*V 2 — 

dx 



- 


dim 

dx 


d 3 r 


The integral containing Laplacians can be transformed into a surface integral by 
Green’s theorem: 3 



It is assumed that the last integral vanishes when taken over a very large surface S. 
What remains is 


I ^ ~ +* Tx * d ’ r 

or, more generally, 

J p 

- (p) = - J ijj*iVV) if, d 3 r = -(VV) = <F> 


(3.41) 


3 Green’s theorem for continuously differentiable functions u and v: 



vVu) ■ dS = 



v - vV 2 u)d 3 r 


where the surface S encloses the volume V. 
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Equation (3.41), known as Ehrenfest’s theorem, looks like Newton’s second law, 
written for expectation values, and conforms to the correspondence principle. If the 
state ip is such that (F(r)> ~ F((r)), it follows from (3.40) and (3.41) that (r) moves 
in time very nearly like a classical particle. In particular, this condition holds for a 
sharply peaked wave packet. 

We can establish a general formula for the time derivative of the expectation 
value ( F ) of any operator F. To this end we write the wave equation (3.1) in the 
form 


* dif/ 

ih — = Hip 
dt 


by making use of the notation 

H = -~v 2 + m 

2m 


(3.42) 


(3.43) 


for the Hamiltonian (or total energy) operator, H. The time derivative of the expec- 
tation value of any operator F, which may be explicitly time dependent, can be 
written as 


m 7 (f) 

at 


= ih f ip* F — d 3 r + ih f — Ftp d 3 r + ih I ip* — ip d 3 
J dt J dt J dt 

= I tp* FHipd 3 r - I (Hip)* Fipd 3 r + ihC~^) 


= j ip* f(^- 


2m 


V 2 + V]ipd 3 r 


- 1 


V 2 + VjFip d 3 r + ih(^^ 


In the last step, use has again been made of Green’s theorem, and vanishing boundary 
surface terms have been omitted. The potential energy is, of course, assumed to be 
real. Compactly, we may write the last result as 


ih 4 (F) = (FH - HF) + th- 
at 



(3.44) 


This formula is of the utmost importance in all facets of quantum mechanics. 


Exercise 3.10. Show that Eq. (3.44) is also true in the momentum represen- 
tation. 


The order in which two operators F and H, both acting to their right, are applied 
to a state ip (or <p) is generally important, and their commutator. 


[F, H] = FH - HF 


(3.45) 


is a measure of the difference between the two products. If 

[F, H] = 0 (3.46) 

the two operators are said to commute. 

According to Eq. (3.44), if F commutes with H and does not explicitly depend 
on t, ( F ) is constant in time for any state. The physical quantity F is then termed a 
constant of the motion and is said to be conserved. 
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Since H commutes with itself, (H) is constant in time, expressing the quantum 
version of the law of conservation of energy for a system with a potential that does 
not explicitly depend on time (conservative forces). 

Linear momentum p = mv is a constant of the motion if p commutes with H. 
Since p commutes with the kinetic energy operator it is only necessary to 

examine whether it also commutes with V(x, y, z). All three components of momen- 
tum commute with V only if V is constant. Hence, in quantum mechanics as in 
classical mechanics, linear momentum is conserved only if no external forces act 
upon the system. 


Exercise 3.11. Prove that the operator p x commutes with a function V(x) of x 
only if V is constant, and generalize this result to three dimensions. 


4. Commutators and Operator Algebra, The commutator [A, B] = AB — BA of 
two operators plays a prominent role in quantum mechanics, as exemplified in the 
last section. The most important commutators are those involving canonically con- 
jugate variables, such as the pairs x and p x , y and p y , z and p z . The manipulation of 
commutation relations for various dynamical variables, as Dirac called the operators 
that represent physical quantities, is most safely accomplished by allowing the op- 
erators to act on an arbitrary function, f, which is removed at the end of the cal- 
culation. Thus, in the coordinate representation, 


(XPx ~ PxX)f = 



h d_ 
i dx 



df 

dx 


d(xf) 

dx 


ihf 


from which we infer the fundamental commutation relation 


[x, p x ] = xp x - p x x = ih 1 (3.47) 

The identity operator 1 may be omitted and the fundamental commutation relations 
for this and the other canonically conjugate variables may be written as 

xp x ~ p x x = yp y - p y y = zp z - p‘z = ih (3.48) 

or, in commutator bracket notation, 


jX p x ] = ly, p y ] = [z, p z ] = ih 1 


(3.49) 


All other products formed from Cartesian coordinates and their conjugate momenta 
are commutative, i.e., xy = yz, xp y = p y x, xp z = p z x, and so on. 

In commutator bracket notation, for any operators the following elementary 
rules are easy to verify: 
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Exercise 3.12. If A and B are two operators that both commute with their 
commutator, [A, 2?],jirove that, for a positive integer n, 

[A, in = H-U fl 5 

[A", B] = nA n ~ l [A, B ] 

Note the similarity of the process with differentiation. Apply to the special case 
A — x, B = p x . Prove that 

ft df 

[p x , fix)] =- ~1 (3.52) 

i dx 

if /(jc) can be expanded in a power series of the operator x. 


Exercise 3.13. Show that each component of orbital angular momentum com- 
mutes with the kinetic energy operator, e.g., 


[L x , 7] - 


yPz ~ ZPy , — {pi + P 2 y 


+ pi) 


= o 


Exercise 3.14. Prove that 

4 (L) = — <r X VV> = <r X F> 
at 


(3.53) 


as expected from correspondence with classical mechanics. 

Another operator relation that is frequently used involves the function e A , de- 
fined by the power series 


- 


1 -\ 1 + h 


1! 2! 


3! 


Consider the function 

/(A) - e M Be~ M 

of a real number A, and make a Taylor series expansion of /(A), observing that 


~ = A/( A) - /(A)A = [A, /(A)] 

dA 

dA 2 


A & 
_ ’ dA 


= [A, [A, /(A)]] 


and so on. Since /( 0) = B, we get the identify 


e^ge-xA = B + A [A> B] + A_ [Af [A> B]] + [A> [A> [A , 5]]] + 


(3.54) 


If the operators satisfy the relation (J3: constant) 

[A, [A, B ]] = pB 


(3.55) 
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the right-hand side of (3.54) can be summed to give 


e KA Be-^ = B cosh AV/3 + 


[A, B\ 

V/3 


sinh aV/3 


A special case arises if (p: constant) 

[A, B]= pi 

leading from (3.56) to 

e^Be-^ = B + Ap 1 

Exercise 3.15. If [A, B] = y B ( y. constant) show that 

e kA Be~ kA = e ky B 


(3.56) 


(3.57) 

(3.58) 


(3.59) 


Equations like (3.54) caution us against the indiscriminate use of well-known 
mathematical rules, when the variables are noncommuting operators. In particular, 
the simple law e A e B = e A+B is generally not valid for operators. Its place is taken 
by the Campbell-Baker-Hausdorff formula, according to which in the equation 

e A g B = e A + B + F(A,B) 

the additional term F(A, B ) is expressible as a generally infinite sum of multiple 
commutators of A and B. 


Exercise 3.16. Consider the product G( A) = e kA e kB and prove that 

f =[a+S + ^[A,B]+^[A,[A,S ] ] + -"}g (3 60) 

= g|a + B + [A, B] + ^ [[A, B], B] + ■ ■ ■ J 

Show that if A and B are two operators that both commute with their commutator 
[A, B ], then 

e A e B = e A + B + lA,BV 2 ( 3 . 61 ) 


Another operator identity with many useful applications concerns differentia- 
tion of an operator that depends on a parameter t. The parameter could be the time 
t, but the formula is general and no commitment as to the nature of the parameter 
is required. If we suppose that A(t ) is an operator that depends on t, the derivative 
of e AW may be expressed as 


d_ 

dt 


e A<t) = 



N f a !* -1 

= lim 2) \ 1 + Tt 1 
k=] [ N J 





(3.62) 


where we use the notation 
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The sum in (3.62) can be turned into an integral over the real variable x, defined as 

n An 1 k Ak 1 

x = — , Ax — — — — or v — — , Ay = — = — 

N ) N N y N y N N 


so that 


~e Ai »= lim [1 + 
dt n — > co V 


-J f e-^ 0 A'COe^ dx 
NJ Jo 

= f e yA(t) A'(t)e~ yA(t) dy lim (l + 

Jo N — \ N I 


and finally, 


_ e m _ e A ( t ) 


dt 


| o A'(0e rfft) dx 

= e yA(0 A'(t)e~ yA(t) dy e A(,) 


(3.64) 


which is the desired result. 4 

Combining (3.64) with (3.54), and performing the x integrations term by term, 
we obtain 


— = a A(t)\ J_ A , 


1 


1 


dt 


e' — e 


1 ! 


A\t) ~ - [A(t), A\t )] + - [A(/), [A(0, A'm - 


2! 


3! 


(3.65) 


= A' if) + ~ [A(f), A'(f)] + ^ [A(0, [A(f), A'(r)]] + 


„A(t) 


Exercise 3.17. Prove the identity (3.64) by a second method, starting from 

the function F{ A) = -7- c ,L4(r) . Differentiate with respect to the variable A and derive 
dt 

a first-order differential equation for the function F. Then solve this equation. 


5. Stationary States and General Solutions of the Wave Equation. The Operator 
formalism provides us with the tools to construct formal solutions of the time- 
dependent Schrodinger equation simply and efficiently. The fundamental mathe- 
matical problem is to obtain a solution of Eq. (3.42), 

di h 

ih — = H<f (3.42) 

dt 

such that at t = 0 the solution agrees with a given initial state if/( r, 0). 

If the system is conservative and the Hamiltonian operator H does not depend 
on t, the solution of (3.42) may be written as 

i/Kr, t) = exp^- ^ Htj if(r, 0) (3.66) 

The simplicity of this formula is deceptive. The Hamiltonian is usually the sum of 
a differential operator (kinetic energy T) and a function of position (potential energy 
V), and e ~ lHtlh cannot be replaced by a product of operators, e ~ lTtlfi e~ lVt ' h . It is fair 


‘For a compilation of many applications of (3.64), see Wilcox (1967). 
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to say that much of the rest of this book will be devoted to turning Eq. (3.66) into 
a more explicit and calculable form for different dynamical systems. 

Here we make the more general observation that any initial state represented 
by an eigenfunction i// E ( r) of the Ham i ltonian operator, corresponding to an eigen- 
value E , gives rise to a particularly simple and important class of solutions (3.66). 
Eigenfunctions i/^(r) and eigenvalues EofH are defined by the eigenvalue equation 

HM r) = Ef E (v) (3.67) 


or, more explicitly, 


+ V(r)«fe(r) - Ef E (r) 
2m 


(3.68) 


known plainly as the Schrodinger equation. This equation is also called the time- 
independent Schrodinger equation in contrast to the time -dependent Schrodinger 
equation (3.1). 

Use of the symbol i/^ £ (r), or merely i/r(r), in the (time-independent) Schrodinger 
equation for a function that is in general entirely different from f(r, t) is sanctioned 
by usage. Using a different symbol might spare the student a little confusion now 
but would make the reading of the literature more difficult later. Whenever neces- 
sary, care will be taken to distinguish the two functions by including the variables 
on which they depend. 

If the initial state is represented by an eigenfunction if/ E (r ) of H, the time- 
dependent wave function becomes, according to (3.66), 


ip(r, t) = exp 



r) 


because for any analytic function / of H, 


(3.69) 


f(H)4r E ( r) = f(E),p E (r) (3.70) 

4 

The particular solution (3.69) of the time-dependent Schrodinger wave equation de- 
rives its great significance from the fact that more general solutions of (3.1) can be 
constructed by superposition of these separable solutions. 

If the states represented by (3.69) are to be physically acceptable, they must 
conserve probability. Hence, the probability density 


p = <A*(r, t ) tfr( r, t) = exp 


~j(E - E*)t 
n 


f E *(r)ifj E (r) 


must satisfy condition (3.10), and this is possible only if the eigenvalues E are real 
numbers. 

If E is real, the time-dependent factor in (3.69) is purely oscillatory. Hence, in 
the separable state (3.69) the functions if/( r, t) and f E (r) differ only by a time- 
dependent phase factor of constant magnitude. This justifies the occasional desig- 
nation of the spatial factor if/ E (r) as a time-independent wave function. (Often, for 
economy of notation, the subscript E will be dropped.) For such a state, the expec- 
tation value of any physical quantity that does not depend on time explicitly is 
constant and may be calculated from the time-independent wave function alone: 



^*(r, t)Aif/(r, t)dfr — 


J 


iJt E *(r)At// E (r)d 3 r 


(3.71) 
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From (3.67) we obtain 


(H) = J ifj E * (r)Htf/ E (r)d 3 r = E (3.72) 

showing that the eigenvalue E is also the expectation value of the energy in this 
state. 

We thus see s that the particular solutions 

t) = exp^-~ Et^jip E (r) (3.69) 

have intriguing properties. The probability density p = | ip( r, t) \ 2 and all other prob- 
abilities are constant in time. For this reason, wave functions of the form (3.69) are 
said to represent stationary states. Note that although the state is stationary, the 
particle so described is in motion and not stationary at all. Also, V • j = 0 for such 
a state. 

Exercise 3.18. Using Eq. (3.7), prove the orthogonality relation 

| ilr'f(r)>p 2 (r)d 3 r = 0 (3.73) 

for any two stationary states with energies E 1 ± E 2 and suitable boundary conditions. 


We have already encountered one special example of a stationary state. This is 
the state of free particle of momentum p and consequently of energy E = p 2 Hm, 
i.e., the plane wave 


i/f(r, t ) = exp 




P r 


This wave function represents a state that has the definite value E for its energy. 
There is no uncertainty in the momentum and in the energy of this plane wave state; 
but there is, correspondingly, a complete ignorance of the position of the particle 
and of the transit time of the particle at a chosen position. 

When conservative forces are introduced, the properties of momentum are, of 
course, drastically altered. But conservation of energy remains valid, as does the 
relation according to which the energy equals h times the frequency. Since a sta- 
tionary state (3.69) has the definite (circular) frequency <w = E/h , it follows that a 
stationary state is a state of well-defined energy, E being the definite value of its 
energy and not only its expectation value. This is to be understood in the sense that 
any determination of the energy of a particle that is in a stationary state always 
yields a particular value E and only that value. Such an interpretation conforms with 
the uncertainty relation, AEAt ^ h, which implies that a quantum state with a precise 
energy (A E = 0) is possible only if unlimited time is available to determine that 
energy. Stationary states are of just such a nature in view of the constancy of 1 1 //| in 
time. 

The observed discrete energy levels of atoms and nuclei are approximated to a 
high degree of accuracy by stationary states. The simple regularity conditions that 
physical considerations lead us to impose on the wave function give rise in many 
cases to solutions of (3.68), which, instead of depending continuously on the param- 
eter E, exist only for certain discrete values of E. The values of E for which (3.68) 



44 Chapter 3 The Schrodinger Equation, the Wave Function, and Operator Algebra 


has solutions consistent with the boundary conditions are said to constitute the en- 
ergy spectrum of the Schrodinger equation. This spectrum may consist of isolated 
points (discrete energy levels) and continuous portions. If there is a lowest energy 
level, the corresponding stationary state is called the ground state of the system. All 
discrete higher energy levels are classified as excited states. 

The careful phrasing concerning the approximate nature of the connection be- 
tween energy levels and stationary states is intentional. The point at issue here is 
that no excited state of an atom is truly discrete, stationary, or permanent. If it were, 
we would not see the spectral lines that result from its decay to some other level by 
photon emission. The excited states are nevertheless stable to a high degree of ap- 
proximation, for the period associated with a typical atomic state is of the order of 
the period of an electron moving in a Bohr orbit, i.e., about 10“ 15 sec, whereas the 
lifetime of the state is roughly 10 -8 sec, as will be shown in Chapter 19. Hence, 
during the lifetime of the state some 10 7 oscillations take place. The situation is 
even more pronounced in the case of the so-called unstable particles, such as the 
kaon or the lambda particle. Their natural periods are = 10” 21 sec, and their lifetimes 
= 10 -, ° sec. Thus, they live for about 10 11 periods. Comparing this with the fact 
that during its history the earth has completed some 10 9 revolutions around the sun — 
a motion that has always been considered one of the stablest — the striking, if par- 
adoxical, feature of the unstable particles is their stability! 

The stationary state wave functions are particular solutions of the wave equa- 
tion. More general solutions of (3.1) can be constructed by superposition of such 
particular solutions. Summing over various allowed values of E, we get the solution. 


«K r, t) = X c E exp 

E 



<Mr) 


(3.74) 


Here as elsewhere in this book the summation may imply an integration if part or 
all of the energy spectrum of E is continuous. Furthermore, the summation in (3.74) 
may include the same energy value more than once. This occurs if the Schrodinger 
equation, (3.67) or (3.68), has more than one linearly independent physically ac- 
ceptable solution for the same eigenvalue E. It is then hot sufficient to designate the 
eigenfunction by i^ e (r), and one or more additional labels must be used to identify 
a particular eigenfunction. The energy eigenvalue E is said to be degenerate in this 
case. 

The example of the free particle in one dimension illustrates these features. In 
this case (3.74) becomes 


’/'(r, 0 = 2 c E 


exp 




(3.75) 


There is degeneracy of degre e two here, because for any positive value of E, p can 
assume two values, p = ±V2 mE. The corresponding eigenfunctions are linearly 
independent. Since the energy spectrum is continuous and E can be any positive 
number, (3.75) must be regarded as an integral. Negative values of E, which cor- 
respond to imaginary values of p, are excluded, because <// would become singular 
for very large values of \x\. The exclusion of negative values of E on this ground is 
an example of the operation of the physically required boundary conditions. 

For a free particle in three dimensions, each positive energy E has an infinite 
degree of degeneracy, since plane waves propagating with the same wavelength in 
different directions are linearly independent. The eigenfunctions are exp(i p • r/h) 
and the eigenvalues are E — (p 2 x + py -F pf)/2m. 
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As a second-order linear differential equation, the Schrodinger equation in one 
dimension for an arbitrary potential V 

fi 2 d 2 if/ E (x ) 

7-5— + V(x)if/ E (x) = Eif/ E (x) (3.76) 

2m dx 

has two linearly independent solutions for each value of E. The Wronskian, 
~ l /4 1) ' of two such solutions of (3.76), t/4 n and if/*jP, is a nonzero con- 
stant for all x. 

Exercise 3.19. Prove that the Wronskian for two solutions of the one- 
dimensional Schrodinger equation is constant. 

Four possibilities can be distinguished: 

(a) Both solutions and their derivatives are finite at all x , as is the case for the 
free particle (V = 0) and for a particle in a periodic potential, if the energy is in an 
allowed band. The degeneracy is twofold. 

(b) One solution and its derivative approach zero as x approaches either + °° 
or —00, or both. The constancy of the Wronskian requires that the other solution 
diverges at large distance, so there is no degeneracy. If a solution approaches zero 
for both x — > ±00 sufficiently rapidly, it represents a bound state. 

(c) Both solutions diverge at large distance, for either x or — «>, or both. 

Neither solution represents a physically acceptable state. This is the case if E lies 
between discrete energy levels or, for a periodic potential, in a forbidden band. 

(d) Finally, for a periodic potential if the energy E is at a band edge, only one 
solution remains finite, while the other one diverges. 

In the preceding discussion, it was assumed that the coordinate x is the usual 
Cartesian coordinate ranging from — 0° to + «>. Sometimes we encounter systems 
that require the imposition of periodic boundary conditions on the solutions of the 
Schrodinger equation. For instance, this is the case for a particle that is constrained 
to move on a closed loop, such as a ring, and the boundary condition to be applied 
is the single-valuedness of the wave function. 5 Generally, the allowed stationary 
states will have twofold degeneracy. 

The potential function V(x) in the Schrodinger equation is usually smooth and 
well-behaved for most values of x. In many applications, however, V(x) is nonan- 
alytic at isolated points. In particular, we commonly encounter potential functions 
that are discontinuous or that have discontinuous slopes at some isolated points x 0 . 
If such a singular potential function can be approximated as the limiting case of a 
regular one, the wave function if/ and its first derivative 1 )/' must be continuous 
everywhere, since the Schrodinger equation is a second-order differential equation. 

More specifically, if x = x 0 is the point of singularity, we get by integrating 
the Schrodinger equation from x = x 0 — s to x — x 0 + s: 

f Xo+£ 2m 

ift'(x 0 + e) ~ if/' (xq - e) = [V(x) ~ E]if/(x) dx 

JXq— 8 fl 

As long as F(x) is finite, whether analytic or not, this equation implies that if/' is 
continuous across the singularity. Hence, if/ itself must also be continuous. 


5 For an analysis of the single-valuedness condition for wave functions, see Merzbacher (1962). 
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Thus, the joining condition to be assumed at a singular point of the potential, 
if V has a finite value, is that the wave function and its slope must be matched 
continuously. The probability current density, which is made up of i ft and i/d, must 
also be continuous — an evident physical requirement, if there are no sources and 
sinks of probability, i.e., of particles. 

Fourier analysis tells us that for any given initial wave packet, if{x, 0), the 
expansion coefficients in (3.75) are uniquely defined. Hence, t) in the form 
(3.75) is not only a solution of the wave equation 


ih — ~ - — — 
dt 2m dx 2 


(3.77) 


but it is the most general solution that can be of physical utility when describing the 
motion of a free particle. The generalization to three dimensions is straightforward. 

Similarly, if the Schrodinger equation for an arbitrary potential yields solutions 
«^ £ (r) that are complete in the sense that any initial state i(r( r, 0) of physical interest 
may be expanded in terms of them, i.e., 

<Kr, 0) = 2 r) (3.78) 

E 


then we automatically know the solution of (3.1) for all t: 



(3.79) 


Thus, i/f(r, t) is a superposition of stationary states, if the initial wave function can 
be expanded in terms of solutions of the Schrodinger equation. 

One more important simple property of the wave equation (3.1) can be discussed 
without specifying V in further detail. It is based on the observation that, if the 
complex conjugate of (3.1) is taken and t is replaced by —t, we get 

dih*(r — ti ft 2 

ih -r = VV*(r, ~t) + V(r)i/f*(r, -t) (3.80) 

dt 2m 

£ 

provided only that V is real. This equation has the same form as (3.1). Hence, if 
if/(r, t ) is a solution of (3.1), — t ) is also a solution. The latter is often referred 

to as the time-reversed solution of (3.1) with respect to ip(r, t). The behavior of the 
wave equation exhibited by (3.80) is called its invariance under time reversal. For 
a stationary state, invariance under time reversal implies that, if ^(r) is a stationary 
wave function, i/4(r) is also one. Hence, it follows that if iff E (r) is a nondegenerate 
solution of (3.68), it must be real, except for an arbitrary constant complex factor. 


Exercise 3.20. Prove the statements of the last paragraph, and show that if 
degeneracy is present, the solutions of the Schrodinger equation with real V may be 
chosen to be real. 


Most of the analysis in this section can be applied directly to the wave equation 
(3.21) in momentum space. The Fourier transform of the time-independent Schro- 
dinger equation (3.68) for stationary states of energy E is the integral equation 



(3.81) 
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The corresponding time-dependent momentum wave function that solves the wave 
equation (3.21) is 

<f>( P, 0 = e~ iEm MV) (3.82) 

For a free particle, (3.81) is simply an algebraic equation, which may be written 
as 

<M P) = 0 

This equation implies that ^(p) = 0 for all p, except if p 2 = 2 mE. In three dimen- 
sions, if p 0 is a fixed momentum with energy E = pl/2m, the momentum space energy 
eigenfunctions are then 

<K P) = S(p - p 0 ) (3.83) 

For a linear harmonic oscillator, with V(x) = mo/x 2 / 2, the Schrodinger equation 
in momentum space (3.80) reduces to (3.23), or 

1 , 1 2 d 2 <f) E (x ) 

— p 2 Ae(x) ~ ~ mo 2 - E4> e (x) (3.84) 

2m 2 dpi 

which is similar to the Schrodinger equation in coordinate space. Furthermore, the 
boundary conditions are analogous: </> must approach zero for large values of \p x \, 
just as is required to tend to zero as \x\ — » °°. Since in one dimension there is no 
degeneracy for bound states, it follows that for the energy eigenstates of the har- 
monic oscillator the Fourier transforms if/ E (x ) and 4>e(p x ) must be simply related to 
each other. 



Exercise 3.21. Show that for the stationary states of the harmonic oscillator, 


$e(Px) = 


,“*E 


' moi 




Px 

.met) , 


(3.85) 


which is a special case of the result in Exercise 3.8. 


Exercise 3.22. For real V, prove that the wave equation in momentum space 
is invariant under time reversal and that <£*(— p, —t) is the time-reversed solution 
with respect to <£(p, t ). 

6. The Virial Theorem. A simple example will illustrate the principles of the last 
two sections. Consider the equation of motion for the operator r • p. According to 
(3.44), 

ih = ' P ’ ^ 


By applying several of the rules (3.50) and the fundamental commutation relation 
(3.47), xp x — p x x — ih, we obtain 


u Px, m = 


2 „2 

X Px , ^ + ^ + 

2m 2m 2m 


= [ x , pi] ~ + x[p x , V] 

2m 

2 2 

= Pxix, Px ] ~ + [ X , p x ] ~ + x[p x , V] = ih — + x[p x , V] 
2m 2m m 


pi , v 


2 ' 
Px 

Tn +V \ 


XPx, ~ 
2m 


+ [xp x , V] 
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Similar relations hold for the y and z components. Combining these results, we get 


d . , 

i <r-p> = 



I 

I 


if/*r • {VW) - Wij/}d 3 r 
ijs*(r ■ VV)iJ/d 3 r 


(3.86) 


Exercise 3.23. Show that r • p — p • r = 3 ih 1, and prove that in spite of the 
noncommutivity of the operators r and p, their order on the left-hand side of (3.86) 
does not matter; the same equation holes for (p ■ r). 

As in classical mechanics, 6 we obtain from (3.86) for the time average over a 
time t 

- [<r • p>,=, - (r ■ p) t=Q ] = 2(f) ~ (r ■ VV> = 2<T) + (r • F) 

T 

If the expectation values in this expression remain finite as r — * °°, the left-hand 
side tends to zero, and 

2 <T> = (r • VV) = -<r^F) (3.87) 

For a stationary state all expectation values in (3.87) are constant in time, and 
it follows that 


2 (T) = (r • VV> = — <r • F) 


(3.88) 


The results (3.87) and (3.88) are known as the virial theorem. In a gas of weakly 
interacting molecules, departures from the ideal gas law can be related to the virial, 
2 r, • F,-, which explains the terminology. 


A 

Exercise 3.24. Apply the virial theorem for a stationary state to the three- 
dimensional anisotropic harmonic oscillator for which 

V(r) — ^ m((o \x 2 + (o\y 2 + ( ojz?) 

Show that the same result could have been derived from the equality given in Ex- 
ercise 3.21. 


Exercise 3.25. If the potential has the property V(Ax, Ay, A z) = A n V(x, y, z), 
using Euler’s theorem for homogeneous functions, show that, for a stationary state 

(T) - ^ <V> (3.89) 

Exercise 3.26. Apply the virial theorem to a central potential of the form 
V = Ar n , and express the expectation value of V for bound states in terms of the 
energy. 


6 Goldstein (1980), Section 3-4. 
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Problems 


1. If the state i/f(r) is a superposition, 

«A(r) = c^i (r) + c 2 i/f 2 (r) 


where i/q(r) and i^ 2 (r) are related to one another by time reversal, show that the 
probability current density can be expressed without an interference term involving 
and ifj 2 . i 

2. For a free particle in one dimension, calculate the variance at time t, (Ax)? = 
((x — (x) t ) 2 ), — { x 2 ) t — (x)? without explicit use of the wave function by applying 
(3.44) repeatedly. Show that 


(A x)? = (A x)l + - 


m 


- (xp x + p x x) o - (x)o (p x ) 


t + 


(A p x f 


m 


and 


(A p x ) 2 = (A p x f 0 = (tsp x f 

3. Consider a linear harmonic oscillator with Hamiltonian 

H=T+V=~+- mo) 2 x 2 
2m 2 

(a) Derive the equation of motion for the expectation value (x), and show that 
it oscillates, similarly to the classical oscillator, as 


(x), — (x)o cos cot + 


(Px) 0 

mco 


sin cot 


(b) Derive a second-order differential equation of motion for the expectation 
value (T — V), by repeated application of (3.44) and use of the virial theorem. Inte- 
grate this equation and, remembering conservation of energy, calculate (x 2 ),. 

(c) Show that 

(Ax)? = (x 2 ) t - (x) 2 = (Ax)o cos 2 cot + sin 2 cot 

m or 

+ 

Verify that this reduces to the result of Problem 1 in the limit a> —» 0. 

(d) Work out the corresponding formula for the variance (A p x ) 2 . 

4. Prove that the probability density and the probability current density at position r 0 
can be expressed in terms of the operators r and p as expectation values of the 
operators 

p(r 0 ) -> 8(r - r 0 ) j(r 0 ) -> ^ [pS(r - r 0 ) + S(r - r 0 )p] 

Derive expressions for these densities in the momentum representation. 

5. For a system described by the wave function i/j( r'), the Wigner distribution function 
is defined as 


- (xp x + p x x ) 0 - (x)o(p x )o 


sin 2 cot 


mco 


W(r', p') 


1 

( 27 rh) 3 


J e -i P ' r"/n 
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(In formulas involving the Wigner distribution, it is essential to make a notational 
distinction between the unprimed operators, r and p, and the real number variables, 
which carry primes.) 

(a) Show that W(r', p') is a real-valued function, defined over the six- 
dimensional “phase space” (r\ p'). 7 

(b) Prove that 

f W(r',p')d 3 p' = |<Kr')| 2 

and that the expectation value of a function of the operator r in a normalized state is 

</(r)> = | | /(r')W(r', p) d 3 r' d 2 p’ 

(c) Show that the Wigner distribution is normalized as 

| W(t', p') d 3 r’ d 3 p' = 1 

(d) Show that the probability density p(r 0 ) at position r 0 is obtained from the 
Wigner distribution with 8 

P (fo) -► f(r ) = S(r - r 0 ) 

6. (a) Show that if (f>( p') is the momentum wave function representing the state, the 
Wigner distribution is 

wv, p ) - ^3 J np' - p'w + pV- d 3 P " 

(b) Verify that 

■ | W(r',p')rfV = |^( P ')| 2 

and that the expectation value of a function of the operator p is 

<s(p)> g(p')W(r', P') d 3 r’ d3 p' 


’Although the integrals of W(r', p') over coordinate and momentum space are probability distri- 
butions in momentum and coordinate space, respectively, the function W(r', p'), which can take neg- 
ative values, is not a probability distribution (in phase space). 

8 Because it involves both r and p in the same expression, the analogous question for the proba- 
bility current density j is deferred until Problem 2, Chapter 15. 



CHAPTER 4 


The Principles of Wave Mechanics 


This chapter concludes our account of the general principles on which 
wave mechanics rests. The central issue is the existence of complete sets 
of orthonormal eigenfunctions of self-adjoint (Hermitian) operators, 
generalized to include the continuous spectrum. The momentum operator 
is related to coordinate displacements, and we encounter the first 
examples of symmetry operations. The formalism of nonrelativistic 
quantum mechanics is joined with the gauge-symmetric electromagnetic 
field. At the end, we should be ready for the application of quantum 
mechanics to spinless particles in one or more dimensions. 


1. Hermitian Operators, their Eigenfunctions and Eigenvalues. We have 
learned that every physical quantity F can be represented by a linear operator, which 
for convenience is denoted by the same letter, F. The expectation value of F is given 
by the formula 


(F) — if/*Fif/ d 3 r = J <f)*F<j) d 3 p 


(4.1) 


expressed either in the coordinate or the momentum representation. We now ask 
what general properties a linear operator must possess in order to be admitted as a 
candidate for representing a physical quantity. 

If (F) is the expectation (or average) value of a linear operator representing a 
physical quantity whose measured values are real, (F) must also be real. By (4.1) 
this implies that for all i fj which may represent possible states we must demand that 



(4.2) 


Operators that have this property are called Hermitian. The property of being Her- 
mitian is independent of the choice of representation. 

Exercise 4.1. Prove that if F is a Hermitian operator, (4.2) can be generalized 
to 


J (F iff 0*^2 d 3 r = j iff^Fif/2 d 3 r 

for any two possible states i/q and if/ 2 . 


(4.3) 


The momentum p is an example of a Hermitian operator. It was shown in Sec- 
tion 3.3 that for the calculation of the expectation value (p) the momentum may be 
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represented by — V if if/ vanishes sufficiently fast at large distances. But under the 
very same boundary conditions, integration by parts gives 



Hence, condition (4.3) is satisfied. [The momentum p is also represented by the 
differential operator (h/i)V if i jj satisfies periodic boundary conditions (Section 4.4).] 
Given an arbitrary, not necessarily Hermitian, operator F, it is useful to define 
its (Hermitian) adjoint F + by the requirement 

J f*F*g d 3 r = j (Ff)*g d 3 r (4.4) 

where f and g are arbitrary functions (but subject to the condition that the integrals 
exist). The existence of the linear operator F f for operators of the type F(r, p) can 
be verified by integration by parts. Comparing (4.4) with (4.3), we note that F is 
Hermitian if it is self-adjoint, 

F + = F (4.5) 

Conversely, if an operator F is Hermitian, and if its adjoint exists and has the same 
domain as F, we have 

J ( Fi(/i)*if/ 2 d 3 r = J ij/^Fif /2 d 3 r = J i/q*F + </f 2 d 3 r 

and hence F is self-adjoint. Since the physical interpretation requires that operators 
representing measurable physical quantities must be self-adjoint, it has become cus- 
tomary in quantum mechanics to use the terms Hermitian and self-adjoint synony- 
mously. We shall follow this usage, although it glosses over some mathematical 
distinctions that are addressed in more thorough presentations. 1 

A number of simple theorems about operators can now be given. Their proofs, 
if not trivial, are indicated briefly: 

1. The adjoint of the sum of two operators equals the sum of their adjoints: 
( F + G)^ = F + 4- G". The sum of two Hermitian operators is Hermitian. 

2. The identity operator 1, which takes every function into itself, is Hermitian. 
If A is a real number, A 1 is Hermitian. 

3. If F is non-Hermitian, F + F + and i(F — F + ) are Hermitian. Hence, F can 
be written as a linear combination of two Hermitian operators: 


F + F f 

F - F f 

F = 

+ i 

2 

2 i 


( 4 . 6 ) 


'Many mathematically unimpeachable treatments have been published since the first appearance 
of von Neumann (1932). For a compact account, see Jordan (1969) and the references cited here. For 
a more recent discussion, see Ballentine (1990). 
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4. If F and G are two arbitrary operators, the adjoint of their product is given 
by 


(FG) f - G + F + 


(4.7) 


with an important reversal of the order of the factors. 
Proof 1 


J ip*(FG) f ip d 3 r = J (FGij/)*^ d 3 r = J (Gi/r)*FV d 3 r = J i /r*G t F t ^ d 3 


But i p is arbitrary, hence (4.7) follows. If F and G are Hermitian, 

(FG) f = GF (4.8) 

Corollary. The product of two Hermitian operators is Hermitian if and only if they 
commute. 


5. The adjoint of a complex number A is its complex conjugate A*. 

6. All the operators listed in Table 3.1 representing physical quantities are 
Hermitian. 

Since the weak requirement of being Hermitian is sufficient to establish the 
most important properties common to the operators that represent physical quanti- 
ties, the generic term dynamical variable is often used for such operators. Thus, 
x and p x are dynamical variables, but their product xp x is not, because the two op- 
erators fail to commute (see property 4). 

Exercise 4.2. Show that the fundamental commutation relation (3.47) for x 
and p x is consistent with the Hermitian nature of the operators. 


Exercise 4.3. From the time-dependent Schrodinger equation (3.42), prove 
the equation of motion (3.44) for the expectation value of an operator F, making 
use only of the Hermitian property of H but not of its explicit form. 

It is not farfetched to suppose that the failure of x and p x to commute is con- 
nected with the uncertainty relation for x and p x and with the incompatibility of 
precise simultaneous values for the x coordinate and the x-component of momentum. 
Given any dynamical variable A, how do we find and characterize those particular 
states of the system in which A may be said to have a definite value? If we make 
the convention that specific numerical values of dynamical variables (physical quan- 
tities) will be denoted by primed symbols (e.g.. A') to distinguish them from the 
operators representing physical quantities (e.g., A), the same question can be phrased 
in physical terms: If we measure A in a large number of separate systems, all replicas 
of each other and each represented by the same wave function if/, under what con- 
ditions will all these systems yield the same sharp and accurately predictable value 
A'? Phrased still differently, what kind of i jj corresponds to a probability distribution 
of the value of A that is peaked at A ' and has no spread? 
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In the particular state if/, in which the outcome of every measurement of A is 
the same, A', the expectation value, { f(A )), of any function of A must be equal to 
f(A'). Hence, we must demand that, in such a state. 


</(A)> = /(A') 

(4.9) 

for any function of A. In particular, for /(A) = A we demand 


(A) = A' 

(4.10) 

and for /(A) = A 2 we require that 


(A 2 ) = A' 2 = (A) 2 

(4-11) 


According to Eq. (A. 35) in the Appendix, (4.11) expresses the vanishing of the 
variance (A A) 2 of A: 

(A A) 2 = ((A - (A)) 2 ) = (A 2 ) ~ (A) 2 


If if/ is assumed to be quadratically integrable, this condition implies for the special 
state in which A has a sharp value A ' that 

j if/*(A - A’) 2 if/d 3 r = 0 

The operator A, being a dynamical variable, is Hermitian, and A' is real. Hence, the 
last equation becomes 

I [(A - A')if/]*(A - A')ip d 3 r = J |(A - A')if/\ 2 d 3 r = 0 
from which it follows that 


Aif/ = A' if/ 


(4.12) 


A quadratically integrable function if/ that satisfies (4.12) is an eigenfunction of A, 
and the real number A' is the corresponding eigenvalue. All eigenvalues of A belong 
to its spectrum. An eigenfunction of A is characterized by the fact that operating 
with A on it has no effect other than multiplication of the function by the eigenvalue 
A'. It may be noted that (4.10) and (4.11) are sufficient to satisfy condition (4.9) 
for any function / of a dynamical variable. 

The eigenvalues are the same in the coordinate or momentum representations; 
the corresponding eigenfunctions are Fourier transforms of each other. This follows 
from the linearity of Fourier transforms. 

The Hamiltonian H representing the energy is an example of a dynamical vari- 
able. If the system has a classical analogue, the operator is obtained from the clas- 
sical Hamiltonian function of q' s and p’s by replacing these physical variables by 
operators. Its eigenfunctions if/ E (r) or <f> E ( p) represent the states in which 
measurement of the energy gives the certain and sharp values E. (We follow the 
customary usage of denoting the eigenvalues of H by E, rather than H', which 
consistency in notation would suggest.) 

Our conclusion is that a system will reveal a definite value A', and no other 
value, when the dynamical variable A is measured if and only if it is in a state 
represented by the eigenfunction if/ A <. We often say that the system is in an eigenstate 
of A. The only definite values of A which a system can thus have are the eigenvalues 
A'. 
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The reality of the eigenvalues of Hermitian operators needs no further proof, 
but a most important property of the eigenfunctions, their orthogonality, does. Ac- 
tually, the two properties follow from the same simple argument. Consider two ei- 
genfunctions of A, if/ x and t f/ 2 , corresponding to the eigenvalues AJ and A' 2 : 

A <Ai = Alfa (4.13) 

A iff 2 = A 2 ifj 2 (4.14) 

Multiply (4.13) bn the left by i)j 2 ; take the complex conjugate of (4.14), and multiply 
it on the right by ip x . Then integrate both equations over all space and subtract one 
from the other. Owing to the Hermitian property of A, 

(A[ - A 2 *) I d 3 r = 0 (4.15) 

If if/ 2 - ipu then A 2 = A[, and hence A[* = A[, which demonstrates again that all 
eigenvalues of A are real. Using this result, we then see from (4.15) that if 
A 2 # A[ the two eigenfunctions are orthogonal in the sense that 

J iff 2 *ifti d 3 r — 0 (4.16) 

We conclude that eigenfunctions belonging to different eigenvalues are orthogonal. 

Since the eigenvalues A ' are real, Eq. (4.15) is trivially satisfied if two different 
eigenfunctions belong to a particular eigenvalue of A. An eigenfunction that is ob- 
tained from another one merely by multiplication with a constant is, of course, not 
considered “different.” Rather, to be at all interesting, all the n eigenfunctions 
belonging to an eigenvalue A ' must be linearly independent. This means that none 
of them may be expressed as a linear combination of the others. 

Although any two eigenfunctions belonging to the same eigenvalue may or may 
not be orthogonal, it is always possible to express all of them as linear combinations 
of n eigenfunctions that are orthogonal and belong to that same eigenvalue (the 
Schmidt ortho gonalization method ). 

Proof. Suppose that 

Aif/\ = A' iff 1 and Aifs 2 = A' if/ 2 

where if/ 2 is not a multiple of 1 i/q, and where the overlap integral of ifj x and 1 fj 2 is 
defined as 

j if/*ifj 2 d 3 r = K ¥= 0 

By interpreting 1 p x and ift 2 as vectors in a two-dimensional space, and J if/* ift 2 d 3 r as 

their scalar (Hermitian inner) product, we can depict the Schmidt procedure in Figure 
4.1. We construct a new vector t ff' 2 as a linear combination of ip x and 1 f/ 2 and demand 
that 



If iff x and i)j 2 are assumed to be normalized to unity, if/ 2 , also normalized, is 

W Vl - K 2 


(4.17) 
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Figure 4.1. Two-dimensional (real-valued) vector analogue of Schmidt orthogonalization. 
The unit vectors represent the (generally complex-valued) functions i fa, fa, and fa. The 
scalar product of the vectors t fa and fa equals K, the length of the projection. 


This new eigenfunction of A is orthogonal to fa . If there are other eigenfunctions 
of A which correspond to the same eigenvalue A ' and are linearly independent of 
fa and fa, this process of orthogonalization can be continued systematically by 
demanding that fa be replaced by fa, a linear combination of fa, fa, and fa, so that 
1^3 is orthogonal to fa and t fa, and so on. When the process is completed, any 
eigenfunction of A with eigenvalue A ' will be expressible as a linear combination 
of fa, fa, fas , ... , fa n . This orthogonal set of functions is by no means unique. Its 
existence shows, however, that there is no loss of generality if we assume that all 
eigenfunctions of the operator A are mutually orthogonal. Any other eigenfunction 
of A can be written as a linear combination of orthogonal ones. Henceforth we will 
therefore always assume that all the eigenfunctions of A are orthogonal and that the 
eigenfunctions, which are quadratically integrable, have been normalized to unity. 

Since 


jfafad 3 r=8 iJ 

the solutions of the equation 

A fa = A- fa 


(4.18) 


(4.19) 


are said to form an orthonormal set. In writing (4.18) and (4.19) we are, therefore, 
allowing for the possibility that two or more eigenvalues of A may be the same. If 
this happened, we speak of a repeated eigenvalue; otherwise we say that the eigen- 
value is simple. The possible occurrence of repeated eigenvalues is not always made 
explicit in the formalism but should be taken into account when it arises, as it 
frequently does. In the case of the energy operator, energy levels that correspond to 
repeated eigenvalues of the Hamiltonian H are said to be degenerate. 
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2. The Superposition and Completeness of Eigenstates, We have seen the phys- 
ical significance of the eigenstates of A. What happens if the physical system is not 
in such an eigenstate? To find out, let us first look at a state that is a superposition 
of eigenstates of A, such that the wave function can be written as 


<A = 2 c ^i 

i 


(4.20) 


Owing to the orthonormality of the eigenfunctions, (4.18), the coefficients c, are 
related to if by 


c ( = J if/* if/ d 3 r 

and the normalization integral is 


(4.21) 



(4.22) 


The expectation value of A in the state if is given by 


(A) = J if*Aif d 3 r = 2 A't | Ci \ 2 (4.23) 

It is important to remember that certain eigenvalues of A appearing in this sum may 
be repeated. The sums of all the |c, | 2 which belong to the same eigenvalue of A will 
be symbolized by 2* |c ; p. 

More generally, for any function of A, 


</(A)> - 2 /(A ;)|c, | 2 

i 


(4.24) 


of which (4.22) and (4.23) are special cases. 

Exercise 4.4. Prove (4.21), (4.22), and (4.23). 

From these equations we may infer that the eigenvalues of A ' which are char- 
acteristic of the operator A, are in fact the only values of A that can be found in the 
measurement of this physical quantity, even if if is not an eigenstate of A. If this 
interpretation is made, then it follows that for simple eigenvalues, | c, | 2 , which de- 
pends on the state, is the probability of finding the value A- when A is measured. 
This conclusion comes about because two probability distributions of the random 
variable A j which produce the same expectation values for any function of A' must 
be identical; hence, the probabilities |c, | 2 are uniquely determined. If if happens to 
be an eigenstate, i f h of A, then c, = 1, and all other Cj = 0 if j 4* i. In this particular 
case, (4.24) agrees with (4.9), showing the consistency of the present interpretation 
with our earlier conclusions. If an eigenvalue of A is repeated, the probability of 
finding it is the restricted sum 2/ k*-| 2 . 

Although this interpretation is natural, it is still an assumption that a sharp line 
can be drawn between the dynamical variables of the system, which determine the 
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possible results of a measurement, and the state of the system, which determines the 
actual outcome of the measurement, at least in a probability sense. 

Equation (4.20) shows how this division of the theory into physical quantities 
(operators) and particular states (wave functions) with its attendant consequences 
for the interpretation of A • and |c, | 2 is related to the possibility of superposing states 
by constructing linear combinations of eigenfunctions. The state ip is in a certain 
sense intermediate between its component states t p t . It bears the mark of each of 
them because a measurement of A in the state i p may yield any of the eigenvalues 
A’i which correspond to eigenfunctions ip t represented with nonvanishing contribu- 
tions in the expansion (4.20) of if. 

The interpretation of the formalism is incomplete, however, until we are assured 
that the wave function ip of an arbitrary state can always be written as a linear 
combination of the eigenfunction of A and that there are no physical states which 
cannot be so represented. 

Cogitation alone cannot provide this assurance, but experience has shown that 
it is legitimate to assume that any state can be represented to a sufficient approxi- 
mation by a superposition of eigenstates of those dynamical variables that are ac- 
tually observed and measured in experiment. Mathematically, the superposition 
appears as a linear combination of the appropriate eigenfunctions, so that we may 
generalize (4.20) and write for any state ip: 

oo 

= 2 CM (4.25) 

i— 1 


where the sum on the right side includes all physically acceptable eigenfunctions of 
A. Since this infinite sum does not generally converge pointwise, the sense in which 
it represents the wave function ip must be clarified. If (4.25) is to be treated as if 
the sum were finite, so that integrations, summations, and differentiations may be 
liberally interchanged, it is sufficient to require that the sum 2 cM converge to a 

i 

state ip in accordance with the condition 


lim 

n—> oo 




(4.26) 


This kind of convergence, which is far more permissive than point-by-point uniform 
convergence, is known as convergence in the mean. The sum 2 cM may be said 
to approximate ip in the sense of a least-square fit at all coordinate points. 


Exercise 4.5. If a set of n orthonormal functions ip t is given, show that the 

n 

superposition 2 cM which best approximates a state (pin the sense of a least-square 

t=i 

fit and which minimizes the integral 


L = \ 


*P ~ 2 c i4fi 


i= 1 




(4.27) 


corresponds to the choice (4.21) for the coefficients c,. Prove that for this optimal 
condition Bessel’s inequality 



2 kl 2 

;= i 


(4.28) 
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holds. As n is increased and a new n + 1-st function is included in the superposition 
of orthonormal functions, note that the first n coefficients c t retain their values and 
that /„ > I n+1 . 

If in the limit n — > <» the sequence /„ approaches zero, the set of orthonormal 
functions represents ip in the sense of (4.26) and we have for all ip Parseval’s 
formula, * 



(4.29) 


This equation is called the completeness relation and expresses the condition that 
an operator A must satisfy if its eigenfunctions ipi are to be sufficient in number to 
represent an arbitrary state. Such a set of eigenfunctions is said to be complete. For 
a large class of simple operators — notably those known to be of the Sturm-Liouville 
type — the completeness of the eigenfunctions can be proved rigorously, but quantum 
mechanics requires the use of many far more general operators for which the com- 
pleteness proof is hard to come by, and one has to proceed heuristically, assuming 
completeness unless proven otherwise. 

The assumption that every physical operator possesses a complete set of or- 
thogonal eigenfunctions is spelled out in the expansion postulate of quantum me- 
chanics: 

Every physical quantity can be represented by a Hermitian operator with 

eigenfunctions ip u ip 2 , . . . , ip n > and every physical state by a sum 2 

where the coefficients are defined by (4.21). 

Following Dirac, we sometimes call an Hermitian operator that possesses a complete 
set of eigenfunctions an observable. 

According to the discussion at the beginning of this section, if A\ is a simple 
eigenvalue, jc, | 2 is the probability of finding the value A' for the physical quantity 
A. If A ■ is repeated, the probability is the restricted sum 2« ! c /| 2 - The coefficients 
c t are called probability amplitudes. They are determined by the state according to 
(4.21) and conversely determine the state fully [Eq. 4.20)]. 

In the last section, we interpreted functions as vectors in order to give a geo- 
metric interpretation of the Schmidt orthogonalization procedure. More generally, 
the expansion of an arbitrary wave function in terms of an orthonormal set of ei- 
genfunctions is reminiscent of the expansion of a vector in terms of an orthonormal 
set of basis vectors. The expansion coefficients are analogous to vector components, 
but the dimensionality of the vector space is no longer finite. The integrals in Eqs. 
(4.18), (4.21), and (4.22) can be thought of as inner products, akin to the scalar 
products for ordinary vectors, and (4.22) and (4.29) represent the formula for the 
square of the “length,” or norm, of the “vector” ip. Linear operators are like tensors 
of rank two, which act on vectors to produce other vectors. 

This geometrically inspired approach of working in a generalized vector (or 
Hilbert) space, suitably extended to complex-valued vectors, underlies the general 
theory of Chapters 9 and 10. It owes its power and aesthetic appeal to its indepen- 
dence of any particular representation. 
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3. The Continuous Spectrum and Closure. In the last section, we have assumed 
that the eigenfunctions of the Hermitian operator A representing a physical quantity 
are countable, although they may be infinite in number. It is essential to remove this 
limitation, since many physical quantities, such as the linear momentum of a particle, 
have a continuum of values. The chief obstacle that restricts the eigenvalue spectrum 
to discrete eigenvalues is the requirement (4.18) of quadratic integrability of eigen- 
functions. 

If quadratic integrability is too narrow a restriction to impose on the eigen- 
functions of A, what should be the criterion for selecting from the solutions of the 
equation 

Aip A ‘ = A’fa (4.30) 


those eigenfunctions that make up a complete set for all physical states? 

The subscript A ' has been attached to the eigenfunctions in the continuum to 
label them. As before, the discrete eigenfunctions are labeled as fa corresponding 
to the eigenvalue A •. 

Boundedness of the eigenfunctions everywhere is usually, but not always, a 
useful condition that draws the line between those solutions of (4.30) that must be 
admitted to the complete set of eigenfunctions and those that must be rejected. Gen- 
erally, it is best to assume that the expansion postulate holds and to let nature tell 
us how wide we must make the set of eigenfunctions so that any physically occurring 
i jj can be expanded in terms of it. We thus tailor the mathematics to the requirements 
of the physics, the only assumption being that there is some fabric that will suit 
these requirements. 2 

If the probability interpretation of quantum mechanics is to be maintained, all 
eigenvalues of A must be real, and eigenfunctions that are not quadratically integra- 
te can appear in the expansion of a quadratically integrate wave function iff only 
with infinitesimal amplitude. Hence, these functions are part of the complete set 
only if they belong to a continuum of real eigenvalues, depend on the eigenvalue 
continuously, and can be integrated over a finite range of eigenvalues. Thus, (4.25) 
must be generalized to read 


r 

*i> = 2 C i^i + C A'^A' dA' 

i 


(4.31) 


In the notation it is tacitly assumed that for a continuous eigenvalue A ' there is only 
one eigenfunction. If there are several (possibly infinitely many) linearly indepen- 
dent eigenfunctions corresponding to an eigenvalue A', more indices must be used 
and summations (or integrations) carried out over these. 

We now extend to the continuum the fundamental assumptions formulated in 
the expansion postulate and generalize (4.24) by requiring that 

</(A)> = j ip*f(A)ifj dfr = 2 |c,-| 2 f(A'i) + J |c A f f(A') dA' (4.32) 

Thus, | c A ' | 2 in the continuum case is the probability density of finding A to have the 
measured value A'. More precisely, [c A '| 2 dA' is the probability that a measure- 
ment of A yields values between A' and A’+dA'. Substitution of (4.31) into 


2 See references in footnote 1. 
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d 3 r shows that the right-hand side of (4.32) is obtained only if, for the 


continuous spectrum, we have the orthogonality condition 


I i d 3 r = 8(A' - A") 


(4.33) 


and if the continuous and discrete eigenfunctions are orthogonal to each other. We 
say that the eigenfunctions t jj A > are subject to A-normalization if (4.33) holds. This 
equation expresses orthogonality and delta-function normalization of the continuum 
eigenfunctions. With the requirement that the eigenvalues must be real, it can be 
merged into the single condition 


j <i. A d 3 r - j (At/v)*</V d 3 r = (A" - A')5(A' - A") (4.34) 

But the right-hand side of this equation vanishes, since xSQt) = 0. Hence, whether 
or not the eigenfunctions are quadratically integrable, we must demand that, the 
Hermitian property of A, 



(4.35) 


should hold for the physically admissible eigenfunctions, in the usual sense of equa- 
tions involving delta functions. The orthonormality condition (4.33) permits eval- 
uation of the expansion coefficients in (4.31), giving 


c A ' = 



(4.36) 


in close analogy with (4.21). 

We remark that the derivations in the last paragraph succeed only if the order 
of the integrations over space and over the range of eigenvalues A' can be inter- 
changed. The license for such interchanges is implicit in the extended expansion 
postulate and is assumed without further notice. 

To keep the notation compact, we often write the equations of quantum me- 
chanics as if all eigenvalues were discrete, the summations over the eigenvalues 
implying integrations over any continuous portions of the spectrum. This convention 
is suggested by the formal similarity of the expansion equations 

4* = 2 Ci = j fi*fid 3 r (4.37) 

for the discrete, and 

= J c A -fi A , dA\ c A . = J d 3 r (4.38) 

for the continuous spectrum. 

Moreover, it is always possible to make ail eigenfunctions quadratically inte- 
grable, and the entire spectrum discrete, by applying boundary conditions that con- 
fine the particle to a limited portion of space. For instance, impenetrable walls at 
great distances can be erected, or periodicity in a large unit cell can be required. 
The imposition of such boundary conditions “quantizes” the continuous spectrum 
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of A. The spacing between the discrete eigenvalues A\ decreases as the confining 
volume grows, and in the limit of infinitely receding boundaries, portions of the 
discrete spectrum go over into a continuous one. The transition is made by intro- 
ducing a density of (discrete) states, p(A'). This is the number of eigenstates fa per 
unit eigenvalue interval: 

A N 

P(A') - (4.39) 

In the limit of an infinitely large confining region, the density p becomes infinite, 
since then the eigenvalues are truly continuous. But usually p is proportional to the 
volume of the enclosure, and it is then possible to speak of a finite density of states 
per unit volume. 

It is easy to see that consistency between the expansion formulas (4.37) and 
(4.38) is achieved if the relations 

<Aa' = faV^A 7 ), c A . = CiVrfA 7 ) (4.40) 


are adopted. Similarly, 

S(A' - A") = p(A')8jj (4.41) 

relates the discrete Kronecker delta to the “continuous” delta function. Hence, if 
the equations are written with the notation of the discrete spectrum, it is a simple 
matter to take proper cognizance of the equally important continuous spectrum, with- 
out which the set of eigenfunctions of an operator A would not be complete. Many 
of the features of the continuous spectrum will be illustrated in Section 4.4. 

A useful condition that a given set of orthonormal functions must satisfy, if it 
is to be complete can be derived from the identity 


<A(r) = S c t fa(r) = ^ if/fr) [ d 3 r' 

i i J 

= J js (r')<Mr)J<Kr') d 3 r' ^ 


Since this must be true for any function </f(r), we infer that 


2 <A*(r')<Mr) = 5(r- r') 


(4.42) 


which is known as the closure relation. If the set of orthonormal functions included 
in the sum is incomplete, the sum will fail to represent the delta function. 

Exercise 4.6. We know that |^,(r)| 2 is the probability density in coordinate 
space if the system is known to be in the i-th eigenstate of A. Use the closure relation 
to deduce, conversely, that the same quantity measures the (relative) probability of 
finding the system to yield A ■ in a measurement of A, if the system is known to be 
at position r. 

4. A Familiar Example: The Momentum Eigenfunctions and the Free Particle. 
In the coordinate representation, the linear momentum p is represented by the op- 
erator p — > hV/i. Its eigenvalue equation 

h 

t Vi// p (r) = piAp(r) 


(4.43) 
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has the orthonormal eigenfunctions 


<A P (r) = 


,/p-r/ft 


(2 7th) 


3/2 


(4.44) 


and eigenvalues -°o < p x , p y , p z < + 00 - Each of the three momentum components 
supports a spectrum of repeated eigenvalues, but the plane waves (4.44) are simul- 
taneous eigenfunctions of all three components, and the three eigenvalues of p x , p y , 
and p z together completely specify the eigenfunction. The occurrence of simulta- 
neous eigenfunctions of commuting operators, like the three momentum components, 
to specify a complete set of eigenfunctions will be discussed in detail later, but the 
explicit example of (4.44) already illustrates the issues involved. 

The expansion postulate implies that an arbitrary wave function can be written 
in terms of momentum probability amplitudes (f> (p) as 




and that the expansion coefficient is given by 

1 




(4.45) 


(4.46) 


(2-tt^) 3/2 

These are, of course, just the standard formulas of Fourier analysis (Section 3.3). 
The orthogonality of the eigenfunctions is expressed by 


1 


| e mx p— p') -- d s r = s( p - p ') 


(4.47) 


(2 TTflf 

which is an integral representation for the delta function (see Appendix, Section 1). 
This equation shows that (4.44) is given in the p-normalization. 

If, in the above formulas, we set p = hk and then choose units so that h = 1, 
the eigenfunctions are said to be given in the k-normalization, and Fourier trans- 
forms like (2.16) and (2.17) are obtained. 


Exercise 4.7. 
relation (4.42). 


Show that the eigenfunctions of momentum satisfy the closure 


In one dimension it is sometimes convenient to use a closely related set of 
eigenfunctions. These are the eigenfunctions of the free particle Hamiltonian, 



h 2 d 2 
2m dx 2 


(4.48) 


They are doubly degenerate for each energy E > 0: 

if/ E (x) = A(E)e ikx and B(E)e~ ikx 

corresponding to the two directions in which the particle can move. The nonnegative 
quantum number k is related to the energy by 

hk = V2 mE 


The energy normalization is determined by the conditions 

f A*(E)A(E')e Kk '- k)x dx — 8(E — E') (4.49) 
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and 


r. 


B*(E)B(E')e 


-i(k'~k)x _ 


dx= S(E - E’) 


(4.50) 


Using the properties of the delta function (compiled in the Appendix, Section 1), 
we can write (4.49) as 


27r|A(£)| 2 


h 


V2m 


8(VE - VE t ) = 8{E - E') 


If E + 0, the identity (A. 32) in the Appendix, 

8(Ve - Ve 7 ) = 2 VES(E - E') 

gives the normalization constant 

fm lml 


|A(£)| 2 = 


2E lirh hp hv 


(4.51) 


where v — p!m is the particle speed. A similar calculation gives \B(E)\ 2 . Hence, if 
the phase constants are chosen equal to zero, the energy-normalized eigenfunctions 
for the free particle in one dimension are ( k > 0) 

/ \ 1/4 


ifj E (x) = 


m 


,%ir 2 fi 2 E 


„±.ikx 


(4.52) 


Any two eigenfunctions (4.52) with different signs in the exponent (representing 
waves propagating in opposite directions) are orthogonal, whether or not they belong 
to the same energy. 

The theory of Fourier series teaches us that there are alternate methods of con- 
structing complete sets of discrete quantized energy eigenfunctions for the free par- 
ticle Hamiltonians. In three dimensions, the eigenvalues of the Hamiltonian 



are defined by imposing certain boundary conditions on the eigenfunctions 

Mr) = C( k>‘ kr 

A particularly useful set is obtained if we require the eigenfunctions to satisfy pe- 
riodic boundary conditions, such that upon translation by a fixed length L in the 
direction of any one of the three coordinate axes all eigenfunctions of H shall assume 
their previous values. This periodicity condition restricts the allowed momenta or 
wave vectors to 


277 

277 

277 

li 

| 

II 

H 

K = T n z 


(4.53) 


where n x , n y , and n z are integers (positive, negative, or zero). The wave functions 
may be normalized in the basic cube of side length L; thus: 


^ [ f I <Ak( r )| 2 dx dy dz = 1 

0 Jo Jo 




4 A Familiar Example: The Momentum Eigenfunctions and the Free Particle 


65 


Hence, 


«K(r) - L _3/2 e ik r 
and the energy eigenvalues are the discrete numbers 

ft 2 !? 27 r 2 h 2 . 


(n x + nz, + nf) 


(4.54) 

(4.55) 


Since each allowed energy value can be obtained by a number of different choices 
of the quantum numbers n x , n y , n z , there is, except for E — 0, considerable degen- 
eracy here. As the energy E increases, the number of ways in which three perfect 
squares can be added to yield 


n 2 + n 2 + n 2 


mEL 2 

2ir 2 h 2 


(4.56) 


rises rapidly. The number of states (4.54) in the volume L 3 Ap x A p y Ap z of phase 
space is A n x An y An z , and according to (4.53) we have 


L 3 A p x A p y A p z = ^ 
An x A n y A n z 


(4.57) 


In statistical mechanics, this last equation is somewhat loosely interpreted by saying: 
“Each state occupies a volume h 3 in phase space.” The number, AN/AE, of eigen- 
states per unit energy interval for any but the lowest energies is to good approxi- 
mation 



Exercise 4.8. Use the Euclidean space in which lattice points are labeled by 
the integer coordinates n x , n y , n z to derive the (asymptotic) formula (4.58) for the 
density of states 3 . Obtain a formula for the cumulative number Z of states whose 
energies are less than or equal to E. Compute the number of ways in which one can 
add the squares of three integers (positive, negative, or zero) such that their sum, 
n 2 + n 2 + n 2 , successively assumes values no greater than 0, 1,2,..., 100. Com- 
pare the numerical results with the asymptotic formula. 

Exercise 4.9. Prove that the free particle energy eigenfunctions, which are 
subject to periodic boundary conditions, satisfy the orthogonality conditions appro- 
priate to a discrete eigenvalue spectrum, provided that the integrations are extended 
over the periodicity volume. 

In one dimension, free particle energy eigenfunctions that are subject to periodic 
boundary conditions give rise to an energy spectrum 

(4.59) 



3 Brehm and Mullin (1989), pp. 82-84. 
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and each eigenvalue, except E = 0, is doubly degenerate. The density of states is 
approximately 

AN _ 2 1 An | ^ | n 
~AE ~ A E ~ ~E 

The main difference between this density of states and its three-dimensional ana- 
logue (4.58) is that it decreases with energy as 1/Ve, whereas the latter is propor- 
tional to Ve. This different behavior is due to the fact that the degree of degeneracy 
of each energy value is independent of E, and equal to 2, for one dimension, whereas 
in three-dimensional space it increases linearly with E. 

Closely related to the foregoing formulation of the free particle problem is the 
problem of the “particle in a box,” familiar from every introductory course in 
quantum physics. In Section 6.4 we will return to this problem as a special case of 
a one-dimensional potential square well of infinite depth, but here we simply assume 
zero potential energy inside a finite volume at whose boundaries V jumps to an 
infinitely high value. Since the expectation value of the potential energy must be 
finite, the wave function must vanish within and on the walls of the box. As a 
convenient example, we choose the box to be a cube of length L, situated in the first 
coordinate octant with one corner at the origin. Inside the box, the Schrodinger 
equation is 

h 2 

- — V z) = Ei(f(x, y, z) (4.61) 

2m 

In a box geometry, the solutions are obtained by separation of variables. Thus, we 
set 



>P(x, y, z ) = X(x)Y(y)Z(z) (4.62) 

and find that the Schrodinger equation separates into three equations: 

r] 2 Y h 2 y d 2 7 

— I + k 2 x X= 0, + k%Y = 0, —j + e z Z= 0 (4.63) 

dx dy dz 

i 

and 


fijp' fj ^ 

2m 2m y z 

The eigenfunctions that satisfy the boundary conditions 

<A(0, y, z) = i KL, y z) = ip(x, 0, z) = ih(x, L, z) = ip(x, y, 0) 


i AC*, y, L) = 0 


are 


g ^ U £ TT 

if/(x, y, z) — C sin k x x sin k y y sin k z z — C sin - £ — x sin - 2 — y sin - 5 — z (4.64) 

L L L 


with t x , t y , t z satisfying the condition 

92 


p2 _i_ p2 , pi _ 2mEL 

C x + -ty + *C Z ^2^2 


(4.65) 


The values of i x , t z are restricted to be positive integers, since a change of sign 
in (4.64) produces no new linearly independent eigenfunction. Hence, the degree of 
degeneracy of any given energy value is less here by a factor of 8 than in the case 
of periodic boundary conditions. However, by virtue of the different right-hand sides 
of (4.56) and (4.65), for a given energy E there are now more ways of finding suitable 
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values to satisfy (4.65). These two effects compensate each other, and as a result 
the density of states is the same for the case of periodic boundary conditions and 
the box. In either case, the eigenfunctions constitute a complete set, in terms of 
which an arbitrary wave packet can be expanded by a Fourier series within the basic 
periodicity cube. 

This comforting result is an example of a very general property of the density 
of states: It is insensitive to the specific details of the boundary conditions and is 
dependent only* on the size (volume) of the region of integration. 4 

Exercise 4.10. For the energy eigenfunctions in the one-dimensional box (Fig- 
ure 4.2), compute the first few terms in the closure relation (4.42). Show how better 


8 mcfiE 
ffi- 



Figure 4.2. The three lowest energy eigenvalues (£ x = 1,2,3) and the corresponding 
eigenfunctions for a one-dimensional box of dimension L = 2a. The unit of energy on the 
ordinate is h 2 /8ma 2 . The maximum amplitude of the normalized eigenfunctions is VI la. 


4 Courant and Hilbert (1953). 
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>8 

ipproximations to the delta function <5(x — x') are obtained as the number of terms 
s increased. 

>. Unitary Operators: The Displacement Operator . As representatives of phys- 
cal observables, Hermitian operators play a paramount role in quantum mechanics, 
>ut unitary operators are of comparable importance. The time development operator, 
vhich links the initial state of a system at t — 0 to its state at an arbitrary time t, as 
'ormalized in (3.66), is unitary. Unitary operators also serve to transform quantum 
nechanical states that describe equivalent physical situations into one another. They 
ire, therefore, at the heart of every analysis that seeks to take advantage of the 
symmetry properties of the physical system under consideration. A systematic dis- 
mission of symmetry in quantum mechanics is taken up in Chapter 17, but it is 
instructive to illustrate the underlying ideas by some simple examples in the realm 
if one-particle wave mechanics. 

A unitary operator U is defined by the condition that it must have an inverse 
and preserve the norm of all state vectors: 

(4.66) 

From (4.66) it follows that a unitary operator preserves the inner (Hermitian 
scalar) product of any two states: 

I [UMr)]*Utfr 2 (r) d 3 r = J ^(r)«A 2 (r) d 3 r (4.67) 

Exercise 4.11. Prove (4.67) from (4.66) by letting if/ = ^ + c 2 ip 2 , for any 

choice of the coefficients Cj and c 2 . 

By the definition of the adjoint operator U r , Eq. (4.67) can also be written as 

j ^(r)U f Ui)j 2 (r) d 3 r = j r)]*0 2 (r) d 3 r 

so that a unitary operator is one that satisfies the relation 

UlP = iPU = 1 1 (4.68) 

Unitary operators, like self-adjoint (Hermitian) operators, can be assumed to have 
complete sets of orthonormal eigenfunctions, since for two eigenfunctions i/q and 
ijf 2 of a unitary operator, corresponding to the eigenvalues U[ and U 2 , the equations 

j mxTUfa d 3 r = U[*U' 2 j d 3 r (4.69) 

and 

(1 - U[*U' 2 ) J d 3 r = 0 

follow. Hence, reasoning analogous to that given for Hermitian operators leads to 
the conclusion that all eigenvalues of a unitary operator have an absolute value equal 
to unity: 



TV. = e ia J 


fa.: real) 


(4.70) 
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and eigenfunctions belonging to different eigenvalues are orthogonal. Again, as for 
Hermitian operators, linearly independent eigenfunctions belonging to the same ei- 
genvalue may be chosen to be orthogonal. 

An example of a useful unitary operator is the displacement or translation op- 
erator D defined by its action on any wave function i}r. 


DpK*) = - 0 

i. 


(4.71) 


where £ is a real number. Although this definition of the displacement operator is 
for wave functions in one dimension, the generalization to higher dimensions is 
immediate. Figure 4.3 illustrates the transformation. 

Since i/s (x — £) is normalized to the same value as i fr(x), the operator D 6 is 
unitary. Successive application of two displacement operators and D v leads to 

D v D^(x) =*(*-£- 77) = D € D v Mx) = D s+V ijr (x) (4.72) 


Hence, displacement operators have the property 

DgD v = D V D S = Dg +V 


(4.73) 


which expresses the fact that the product of two displacements is again a displace- 
ment. Such closure under multiplication is the chief characteristic of a group. The 
additional properties defining a group are the existence of an identity : 


£>o = 7 (4.74) 

and an inverse of every element: 

DJ l = D_ f (4.75) 

such that 

Dj'Df = £> 0 = 7 (4.76) 

Finally, the multiplication of the elements of a group must be associative : 

D c (D e D v ) = (D ( D e )D v (4.77) 

which is true for displacement operators, since both sides of this equation are equal 
to D £+ £ +v . The translation group is said to be Abelian because its elements commute, 
but it should be noted that this is not a general requirement for groups. 



Figure 4.3. Linear displacement or translation of a function i jj(x) — > D f i f/(x) = ^(x — £) 
by a displacement vector £ in one dimension. The transformation of the function is active. 
If £ = vt, this figure illustrates an active Galilean transformation (Section 4.7) for a 
positive value of t. 
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From the basic property of the displacement operator it is seen that for an 
arbitrarily small displacement e, to first order: 

D e = 1 — iek (4.78) 

where k must be a Hermitian operator to ensure the unitarity of D t 

Exercise 4.12. Prove that if Dg is to be unitary, it is necessary that k be 
Hermitian. 


Combining an arbitrary displacement £ with a small displacement e, we can 
write: 


Z7g +e — DgD e — D^(l iek ) 

from which we infer the differential equation 

(4.79) 

dD* D f+ - 

= lim € = -iD 6 k 

d(^ s — >0 & 

With D 0 = 1 for £ = 0, (4.80) has the solution 

(4.80) 

Dg = e~ ik6 

Since 

(4.81) 

, £ , g 2 „ 

/ d \ 

D^(x) = i \ff(x - £) = i (f(x) - ip (x) + ~ if/ (x) ± . . . 

it is seen that, in the coordinate representation, 

1 d 

. = exp(~£-j Hx) 

(4.82) 

k = 

i dx 

(4.83) 


which is proportional to the linear momentum operator p x . The Hermitian operator 
k is said to be the generator of infinitesimal translations. 

The eigenvalues and eigenfunctions of the unitary operator Dg can be deter- 
mined from its representation in terms of the Hermitian operator k. For the latter we 
already know that the eigenvalue equation 

kf(x) = = k'f(x) (4.84) 

l dx 

is satisfied by the eigenfunctions 

f(x) = e ik ' x (4.85) 

The eigenvalue k' of the operator k may be any real number. It cannot have 
an imaginary part, because the eigenfunctions would then diverge for x — >■ + <» or 
x —> — 00 and not represent possible physical states. 

Since Dg is related to the operator k by Eq. (4.81), the eigenvalues of the dis- 
placement operator are expressed as 

D' s = e~ ik ' € (4.86) 

from which it is seen that the same eigenvalue results if A:' is replaced by k' + 
Irm! £, where n is any integer. Thus, we can limit ourselves to values of k' in an 
interval of length 2 tt/£, such as ~tt/£ < k' s +tt/£ which is referred to as the first 
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Brillouin zone. The degenerate eigenfunctions of corresponding to the eigenvalue 
D'j: = exp(— ik'i;) are 

f n (x ) = e l( - k ' +2 ™'£> x ( n: any integer) 

Hence, the linear combination 


g(x) = 



c gW +27Tn/ & x 


Ak'x 



c e i2irrn/ € x 


(4.87) 


is the most general eigenfunction of Dg corresponding to the eigenvalue (4.86). The 
coefficients c n are arbitrary. The eigenfunctions 

g(x) = e ik ' x u(x) (4.88) 

are therefore products of plane waves and arbitrary periodic functions u(x) of x: 

u(x — €) = u(x) (4.89) 

with period £. Functions that have the structure (4.88) are known as Bloch functions. 


Exercise 4.13. By requiring that g(x) defined by (4.88) be an eigenfunction 
of D%, with eigenvalue D' € = e~ lk ' s , verify the periodicity property (4.89) directly 
from the definition of D f as a displacement operator. 


Since displacements in three dimensions commute and may be carried out 
equivalently in any order, it is evident that, in three-dimensional space, Eq. (4.81) 
is generalized to 

D & = e~ ikS = exp {~i{k'A + kyV + KO) (4.90) 


where the displacement vector 8 is represented by its three Cartesian components 
(£, rj, £). The generator of infinitesimal displacements 



(4.91) 


is proportional to the momentum operator p. The generalization of the concepts of 
Brillouin zones and Bloch functions to three dimensions involves no new ideas. It 
forms the backbone of every analysis of the physics of periodic lattices, such as 
crystalline solids. 


Exercise 4.14. If d t , d 2 , d 3 are the primitive translation vectors of a three- 
dimensional infinite lattice whose points are at positions R = n x d x + n 2 d 2 + « 3 d 3 
(n, = integer), show that any simultaneous eigenfunction of all the translation op- 
erators £> r = e -,k ' R corresponding to eigenvalues e~ ,k R is of the form u(r)e ,k r , 
where w(r) is an arbitrary periodic function of r over the lattice, i.e., u ( r + R) = 
w(r), for every R. Note that any two values of k' which produce the same eigenvalue 
e~' k ‘ R differ by a reciprocal lattice vector G, defined by the condition G • R = l-nn, 
where n is any integer. 


6. The Charged Particle in an Electromagnetic Field and Gauge Invariance. 
The Hamiltonian operator considered so far and listed as the total energy in Table 
3.1 had the simple form 
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corresponding to a conservative system with potential energy function V , such as a 
charged particle in an electrostatic field. But classical Hamiltonian mechanics is not 
limited to such systems, and neither is quantum mechanics. If a particle of charge 
q moves in an external electromagnetic field , which may be time-dependent, the 
classical equations of motion may be derived as Hamilton’s equations from a Ham- 
iltonian 




+ qfi 


(4.93) 


Here and A are the scalar and vector potentials of the electromagnetic field, re- 
spectively, both of which are now assumed to be given functions of r and t. The 
validity of this Hamiltonian operator in quantum mechanics is supported by a cal- 
culation of the time rate of change of the expectation value of r for the particle: 

4 <r> = ^ <fr. «]> = - (p - 1 A) (4.94) 

dt in m \ c / 


Evidently, this equation defines the velocity operator: 


1 

v = — 

L- q -A) 

m 

\ c / 


(4.95) 


and this relation between the canonical momentum p and the velocity v, in the 
presence of a vector potential A, has the same form in quantum mechanics as in 
classical mechanics. 

From (4.95) we infer that the components of the velocity operator generally do 
not commute and that, since the magnetic field B — V X A, 


v X v — — %r (p X A + A X p) = ih -y- (V X A) = ih B (4.96) 


m 2 c 


m c 


me 


Newton’s second law in quantum mechanical form is 


d . , 1 /r . /dv\ 1 

1, < T> = a <[¥ - m + \W = a 


m 

V, - V • V 


1 . . q /dA' 

+ -r ([v, q<f>]) ( — 

in me \ dt / 


The right-hand side can be evaluated by using the operator identity 

[v, V • v] = V X (v X v) - (v X v) X V 


(4.97) 


Exercise 4.15. Prove the identity (4.97) for any operator v whose components 
may not commute. 


Combining (4.96) and (4.97) we find 


m — (v) — — (v X B — B X v) — q{5J 4>) 
dt 2c 




If we recall that the electric field E is 
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we arrive at the quantum mechanical version of the equation for the acceleration of 
the particle in terms of the Lorentz force: 


d / \ 
m ~(y ) 


s_ 

2c 


(v X B — B X v) + g(E) 


(4.98) 


In all of this, the cross products of two vectors require careful treatment because 
noncommuting factors are involved. 

At this stage, it is appropriate to discuss the choice Of the electromagnetic 
potentials in quantum mechanics. Since 

V X (V/) = 0 and V — = - V/ 

J dt dt 

the fields E and B are unchanged by the gauge transformation on the potentials: 


A' = A + V/ (f> (f> 1 = (j) — 


19 / 

C dt 


(4.99) 


where /( r, t ) is an arbitrary differentiable function of r and t. The primed and 
unprimed potentials are said to represent two different but equally valid gauges. In 
the new primed gauge, the Hamiltonian is 

H'=-(p--A'\ + q<f>’ (4.100) 

2m \c/ 

The time-dependent Schrodinger equation in the old gauge is 

ih ( V " fc A ) * (r ’ 0 + q4> * {T ' 0 (4 - 101) 

and in the new gauge it has the form 

ih = ( V " fc A ') ^' (r ’ f) + 0 (4-102) 

where the new potentials A' and are related to the old potentials by (4.99). The 
new wave function i f/ r is related to the old wave function by the unitary phase 
transformation, 

if/’( r, t ) = U>f/( r, t) (4.103) 

with 


U = O/ftc 


(4.104) 


If the phase is a constant, we speak of a global gauge transformation. Since multi- 
plication by a constant phase factor e ia is unimportant in ordinary quantum me- 
chanics where ip is a probability amplitude, global gauge transformations are not 
interesting here. In quantum field theory, however, where particles can be created 
and destroyed, these transformations have significance. 

To prove that (4.104) constitutes a local gauge transformation, we merely need 
to substitute t p' as given by (4.104) in (4.102) and deduce that this equation is 
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identical to (4.101) if the Hamiltonians (4.93) and (4.100) in the two gauges are 
connected by 


H' = UHU f + ih — W 
dt 


(4.105) 


To check the validity of this condition, we use the definition (4.99) and the com- 
mutation relations to verify that the gauge transformation (4.104) leaves the oper- 
ators r and v (but not p) invariant: 


UrU f = r 







(4.106) 


and that, further. 


* dU , 
ih — If = 
dt 


qdf_ 
c dt 


(4.107) 


Exercise 4.16. Carry out the details of the gauge transformation explicitly to 
show that (4.105) holds with any unitary transformation of the form (4.104). 


The representation of the canonical momentum operator p, as distinct from the 

ft 

kinetic momentum m\ - p — - A, has been kept the same, p = — V, for all choices 

c i 

of the gauge. The fundamental commutation relations are gauge-invariant, and the 
gauge invariance of the theory finds its expression in the relation 




if = 



~ c A' JUift = ijj'* 




r 


(4.108) 


This connection ensures the gauge invariance of the exp'ectation value of the kinetic 
momentum. 


iq 

Exercise 4.17. Show that the substitution V V — — A in (3.6) produces a 

he 

gauge-invariant current density and that this new j satisfies the continuity equation 

dp 


dt 


+ V • j = 0 


for the Schrodinger equation (4.101) in the presence of an electromagnetic field. 


Although we have assumed in the foregoing discussion that an electromagnetic 
field is actually present, gauge invariance considerations apply equally if the particle 
is free (zero E and B fields) and the potentials satisfy the conditions 

1 9A 

V X A = 0 and V<£ H = 0 

c dt 

Under these conditions, the trivial gauge </> = 0, A = 0 can be chosen. Other gauges 
are perfectly acceptable, and the canonical and kinetic momentum do not necessarily 
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have to be equal. But it is generally assumed that in the absence of electric and 
magnetic fields, or if the particle is neutral, the choice <p - 0, A = 0 has been made, 
fixing the most natural gauge. 

Exercise 4.18. For a charged particle in a uniform static electric field E, show 
the connection between the two gauges: — - E • r, A = 0, and — 0, A' = 
— cEt. Derive the gauge transformation function /( r, t ) and the two Hamiltonians. 
Obtain the equations of motion for the expectation values of r and p in the two 
gauges and compare their solutions. 

Exercise 4.19. Show that the Schrodinger equation in the presence of a static 
magnetic field is not invariant under time reversal. 

The main lesson to be learned from this analysis of gauge invariance is that the 
wave function for a particle is dependent on the explicit or implied choice of a gauge 
and that the potentials A and must be known if the meaning of if/ is to be properly 
communicated. In particle physics, the invariance of the quantum mechanical de- 
scription under multiplication of all wave functions by a spacetime-dependent phase 
factor exp [ iq f{ r, t)/hc] is elevated to a fundamental principle of local gauge sym- 
metry. Its imposition necessarily requires that, in addition to a wave function if/, the 
state of the system be described by a gauge field represented by A and </>. In this 
view, the electromagnetic field is regarded as a manifestation of local gauge sym- 
metry. The extension of these ideas to particles with additional (internal) degrees of 
freedom will be sketched in Section 17.10. 

7. Galilean Transformations and Gauge Invariance. The translations consid- 
ered in Section 4.5 were purely spatial displacements and had nothing to do with 
time. We now turn to transformations that involve uniform motion with constant 
velocity v. Using the tools developed in Section 4.5, we attempt to relate the state 
if/(r, t) of a system at time t to the state if/ v ( r, t ) of the same system, after it has been 
boosted so that it is moving with velocity v. According to the principle of relativity, 
the absolute uniform motion of a closed physical system is undetectable. In nonre- 
lativistic physics we speak of the Galilean principle of relativity. There is every 
reason to believe that the principle of relativity is valid in quantum mechanics, as 
it is in classical physics. If the wave function were itself an observable physical 
quantity (which it is not), we would expect that the boosted state was just if/( r — \t, 
t ) for all times t. For the simple example of a free particle state represented by a 
plane wave with momentum p, 

if/( r, t) = e i<p - r ~ Em = g ,(kr_a,f) (4.109) 

This would imply that the boosted plane wave state is 

lf/{ r - Vf, t) = e ilP -(r-vU-aiM = gi[k-r— ai«— k-vf] (4.110) 

This expression is plainly wrong, since it implies that the momentum and the cor- 
responding de Broglie wavelength are the same for the boosted system as for the 
system at rest, while the associated frequency has been (Doppler) shifted from cu to 
(o + k • v. Instead, we expect that in the boosted state the momentum is p + m\ and 
the energy E v = E + p • v + mv 2 / 2, corresponding to a frequency on + k • v + 
mv 2 /2h, where |v| = v. 
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To be in compliance with the Galilean principle of relativity and obtain a correct 
wave function for the boosted state if/ v ( r, t), we must multiply ip(r - vt) by an 
appropriate spacetime-dependent phase factor: 


<A v (r, t) = e i(mv r ^ mv2t/2 > /n l jj(r - \t, t) 


(4.111) 


The phase factor in (4.112) depends on v, but not on p and E. 


Exercise 4.20. Check that (4.111) gives the correct plane wave function for 
the boosted state. 


More generally, we find that if iKr, 0 satisfies the wave equation (4.101): 

-12 


ifi 


dt/>( r, t) 
dt 


2m 


V - ^ A(r, 0 
he 


t ) + q<f( r, O'Mr, 0 (4.101) 


the wave function if/(r — vt, t ) satisfies 

2m 

+ q<f)( r - vt, t)*K r - vt, t ) - ihv • V<Mr - vt, t) 


.„ di//(r - vt, t ) 

in 

dt 


V - ^ A(r ~ vt, t) 
nc 


«K r - vt, t) 


where A(r - vt, t) and (f> (r - vt, t) are the boosted potentials acting on the boosted 
quantum system. The extra term on the right-hand side shows that this is not the 
correct wave equation and that <//(r — vt, t) is not the correct wave function for the 
boosted state. We may rewrite the last equation as 


diKr ~ vt, t) 

in 

dt 


2m 


V — — - A(r - vt, t) + ~ mv 
nc n 


-12 


*M r “ vt, t) 


+ 


9 <^(r - vt, t) - 


mu 


(4.112) 


*A(r - vt, t) 


If we invert (4.111): 

<Kr - vt, t) = e~ Kmv ^ mv2t/2Vh ijj v (r, t) 


and substitute this into (4.112), we obtain the correct wave equation for the boosted 
state: 


ih 


dif/ v (r, t ) _ ti 1 


dt 


2m 


V — A(r - vt, t) 
nc 


-12 


<A v (r, 0 + q(j)(r - vt, f)iM(r, t) (4.113) 


The transformation (4.111) describes the relation between two different but 
equivalent states of a system or two replicas of a system. Such a boost is called an 
active Galilean transformation (Figure 4.3). It is to be distinguished from a trans- 
formation between two descriptions of the same state of one system, viewed from 
two different frames of reference, which are in relative motion. This transformation 
is called a passive transformation. 

To show how a passive transformation works, we start again with the wave 
function i(f( r, t) characterizing the given state in frame-of-reference S with coordi- 
nates x, y, z, and position vector r. We assume that another frame of reference 
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<A f 



Figure 4.4. Passive Galilean transformation, x' — x — vt, of a wave function, effected by 
the mapping ift{x) — » fix') = fix' + vt). Compare with the active transformation shown 
in Figure 4,3. 


S'(x', y', z ') is in; uniform translational motion with velocity v relative to S. A 
Galilean coordinate transformation (Figure 4.4) is given by 

r' = r — \t and t' = t (4.114) 

Here the second, trivial, equation has been introduced to reduce the likelihood of 
mathematical errors in carrying out the transformation. It also prepares the way for 
use of Lorentz rather than Galilean transformations, in which case the time trans- 
forms nontrivially. From (4.114) we deduce that 

d S 

- = — - v • V' and V = V' (4.115) 

dt dt' 


Hence, if we define the potentials in S' as 

A'(r', t') = A(r, t) = A(r' + \t' , t') and 4>'(r', t') = <f>(r, t ) = <£(r' + vt', t') 
the wave equation (4.101) in S is transformed into 

h 2 


dif/(r' + vt', t') 

in : 

dt' 


2m 


V' - ^ A'(r\ t') - j mv if/(r' + vt' , t') 
nc n 


+ 


q<f>'( r\ t') 


mv 


<Kr' + vt' , t') 


Since this equation in the primed frame does not have the same form as (4.101) in 
the unprimed frame, the Galilean principle of relativity appears to be jeopardized. 
However, the simple gauge transformation that defines the wave function iff' it' , t') 
in 5', 


lf/'(r', t') = e -'<™vr+,mA/2)/^( r , + t = e — i(mv- r — mv 2 r/2)/h ^ t ) (4.1 16) 


restores the expected manifest Galilean form invariance: 


ih 


diff'(r',t') 

dt' 


2m 


~l2 


V' 


iq_ 

he 


A'(r ',t') 


iff'ir', t') + qcj>'(r', t')iff'(r', t') 

(4.117) 


Nonrelativistic quantum mechanics is thus seen to be consistent with the Galilean 
principle of relativity owing to the gauge dependence of the wave function. 

It is remarkable that if the momentum and energy in the free particle wave 
function are regarded as components of a relativistic four-vector, the plane wave is 
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invariant under Lorentz transformations, and there is no additional phase or gauge 
transformation. In Chapters 23 and 24 we will return to these issues in the context 
of relativistic quantum mechanics. 

Exercise 4.21. If the plane wave (4.109) describes a free particle state in the 
frame of reference S, show that the passively transformed wave function if/'(r ' , t ') 
correctly accounts for the momentum of the same particle in frame of reference S' . 
This result implies that the de Broglie wavelength is different as viewed from two 
frames, which are in relative motion. Compare and contrast the Galilean transfor- 
mation of if/ waves with the analogous transformation of classical waves. (See also 
Section 1.2.) 

Exercise 4.22. In frame S, consider a standing-wave state if/ that is a super- 
position of a plane wave (4.109) and a plane wave of equal magnitude and arbitrary 
phase but opposite momentum, as might be generated by reflection from an impen- 
etrable barrier. Show that the nodes in the wave function if/ are separated by half a 
wavelength. If the same state is viewed from a moving frame S', show that the two 
counter-propagating waves have different wavelengths, but that the spacing of the 
nodes nevertheless is the same as in S. 


Problems 

1. Show that the addition of an imaginary part to the potential in the quantal wave 
equation describes the presence of sources or sinks of probability. (Work out the 
appropriate continuity equation.) 

Solve the wave equation for a potential of the form V — - V 0 (l + if), where V 0 
and f are positive constants. If f « 1, show that there are stationary state solutions 
that represent plane waves with exponentially attenuated amplitude, describing ab- 
sorption of the waves. Calculate the absorption coefficient. 

2. If a particle of mass m is constrained to move in the xy plane on a circular orbit of 
radius p around the origin 0, but is otherwise free, determine the energy eigenvalues 
and the eigenfunctions. 

3. A particle of mass m and charge q is constrained to move in the xy plane on a circular 
orbit of radius p around the origin 0, as in Problem 2, and a magnetic field, repre- 
sented by the vector potential A = <bk X r/[27r(k X r) 2 ], is imposed. 

(a) Show that the magnetic field approximates that of a long thin solenoid with 
flux placed on the z axis. 

(b) Determine the energy spectrum in the presence of the field and show that it 
coincides with the spectrum for <E> = 0 if the flux assumes certain quantized values. 
Note that the energy levels depend on the strength of a magnetic field B which differs 
from zero only in a region into which the particle cannot penetrate (the Aharonov- 
Bohm effect). 



CHAPTER 5 


The Linear Harmonic Oscillator 


In solving the Schrodinger equation for simple problems, we first assume 
that i[/ and V depend only on the x coordinate. The Hamiltonian for a 
conservative system in one dimension is 

H = ~ + V(x) (5.1) 

2m 

The eigenvalue equation for this Hamiltonian, in the coordinate 
representation, is the ordinary Sturm-Liouville differential equation 

h 1 2 d 2 th(x ) 

~ r- -7T 2 + V(x)i/,(x) = EfCx) (5.2) 

2m ax 

We will solve this Schrodinger equation for several special forms of the 
potential V(x). The harmonic oscillator stands out because in many 
systems it applies to motion near a stable equilibrium. It governs the 
dynamics of most continuous physical systems and fields, has equally 
spaced energy levels, and provides the basis for the description of many- 
particle states. In this chapter, we discuss the oscillator in the coordinate 
representation, leaving for later (Chapters 10 and 14) the more general 
representation-independent treatment. 


1 . Preliminary Remarks. For the linear harmonic oscillator, the potential energy 
term in the Schrodinger equation is 


V(x) — - mo) 2 x 2 


(5.3) 


We call <y loosely the (classical) frequency of the harmonic oscillator. In the neigh- 
borhood of Jt : — a, an arbitrary potential V(x) may be expanded in a Taylor series: 

V(x) = V(a) + V'(a)(x - a) + ^ V"(a)(x - a) 2 + ■■■ 

If x — a is a stable equilibrium position, V{x) has a minimum at x = a, i.e., 
V'{a) = 0, and V"{a) > 0. If a is chosen as the coordinate origin and V(a) as the 
zero of the energy scale, then (5.3) is the first approximation to F(x). A familiar 
example is provided by the oscillations of the atoms in a diatomic molecule, but 
harmonic oscillations of various generalized coordinates occur in many different 
systems. 

The linear oscillator is important for another reason. The behavior of most 
continuous physical systems, such as the vibrations of an elastic medium or the 
electromagnetic field in a cavity, can be described by the superposition of an infinite 
number of spatial modes, and each expansion coefficient oscillates like a simple 
harmonic oscillator. In any such physical system, we are then confronted by the 
quantum mechanics of many linear harmonic oscillators of various frequencies. For 
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this reason, all quantum field theories make use of the results that we are about to 
obtain. 

Two parameters, the mass m and the frequency o, appear in the Hamiltonian 
(5.1) with potential (5.3). They may be used to fix units of mass (m) and time (l/o>). 
In quantum mechanics, Planck’s constant provides a third dimension al num ber and 
thus allows us to construct a characteristic energy (hco) and length (Vnkw). In the 
interest of accentuating the great generality of the linear harmonic oscillator, we 
employ these scaling parameters to replace the canonical coordinate x and momen- 
tum p x by the dimensionless variables 


1 = 



(5.4) 


and 



In terms of the new variables, the Hamiltonian is 


(5.5) 




If the Hamiltonian is similarly replaced by the dimensionless quantity 


(5.6) 


H 


1 


1 


»t = jr = ipi + ir 

nu> 2 2 

and the time by the dimensionless variable 

T — (i)t 

Hamilton’s equations for the system reduce to the usual form 


d A 

dr 


dpt 


dpj 

dr 


dH s 


(5.7) 


(5.8) 


(5.9) 


Exercise 5.1. Show that Eqs. (5.9) are the correct equations of motion for the 
linear harmonic oscillator defined by (5.1) and (5.3). 


The fundamental commutation relation 


xp x — Px x — iftl 

(5.10) 

is transformed by (5.4) and (5.5) into 


IPs “ Pd = u 

(5.11) 

The eigenvalue problem for is expressed as 


= ei ft 

(5.12) 

and the energy E is related to e by 


E = ehoi 

(5.13) 
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The Schrodinger equation for the linear harmonic oscillator is transformed into 

~^+{2e- e)*=0 (5.14) 

By using the same symbol if/, in both (5.2) and the transformed equation (5.14) for 
two different functions for the variables x and £, we indulge in an inconsistency 
that, though perhaps deplorable, is sanctioned by custom. 

If a power series solution of this equation is attempted, a three-term recursion 
formula is obtained. To get a differential equation whose power series solution ad- 
mits a two-term recursion relation, which is simpler to analyze, we make the sub- 
stitution 

■Kl) = e-t 2,2 v(£) (5.15) 


This yields the equation 


(fv 

df 2 


dv 

2£ — - + 2 nv — 0 
d£ 


where n is defined by the relation 

1 

e = n + — 
2 


(5.16) 


(5.17) 


or 



(5.18) 


Exercise 5.2. Substituting a power series with undetermined coefficients for 
if/ and v into (5.14) and (5.16), obtain the recursion relations and compare these. 


2. Eigenvalues and Eigenfunctions. One simple and important property of the 
harmonic oscillator derives from the fact that V is an even function of x, if the 
coordinate origin is chosen suitably. Generally, if the potential energy satisfies 
the condition V(-x) = E(x), and if i f/(x) is a solution of the Schrodinger equation 
(5.2), then it follows that if/(~x) is also a solution of this equation. The Schrodinger 
equation with even V(x) is said to be invariant under reflection, for if x is changed 
into —x, the equation is unaltered except for the replacement of if/(x ) by if/(—x). Any 
linear combination of solutions (5.2) also solves (5.2). Hence, if if/(x) is a solution 
of the Schrodinger equation, the following two functions must also be solutions: 

fft e (x) = ^ [i f/(x) + if/(-x)] i f/ 0 (x) = ^ [if/(x) - if/(-x )] 

These are the even ( e ) and odd (o) parts of < f/(x), respectively. Thus, in construct- 
ing the solutions of (5.1) for even E(x), we may confine ourselves to all even and 
all odd solutions. A state which is represented by a wave function that is even, 
ifj e (—x) — i f/ e (x), is said to have even parity. Similarly, we speak of odd parity if 
fpo(~x) = -i ft a (x). 


Exercise 5.3. Extend the notion of invariance under reflection (of all three 
Cartesian coordinates) to the Schrodinger equation in three dimensions. Show that 
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if V depends merely on the radial distance r, only solutions of definite parity need 
be considered. 


Since £ is proportional to x, Eqs. (5.14) and (5.16) are invariant under reflection 
£ — > — £, and we need to look only for solutions of definite, even or odd, parity. For 
a linear second-order differential equation like (5.16), it is easy to construct the even 
and odd solutions, v e and v 0 , by imposing the boundary conditions: 

Even case: u e (0) = 1, v' e (0 ) = 0 
Odd case: u o (0) = 0, n'(0 ) = 1 

By substituting 

v e (£) = 1 + «2 £ 2 + + • ■ * 


or 


v 0 (€) = € + « 3^ 3 + « 5^ 5 + • • * 

into (5.16) and equating the coefficient of each power of £ to zero (see Exercise 
5.2), we obtain the power series expansions 


2 n 


«,© = 4! 


2 2 2 n{n - 2) t4 _ 2 3 n(n - 2 )(n - 4) + 


r - 


6! 


v„(M) = £ ~ 


2 (n - !) 3 + 2 2 (w - 1)(» - 3) 


(5.19) 


3! 


5! 


The rule that governs these expansions is evident. 

In order to determine if such a solution is quadratically integrable and can 
describe a physical state, we must consider the asymptotic behavior of v e and v Q . 
How do these functions behave for large values of | £|? For the purpose at hand, we 
need not become involved in complicated estimates. Unless n is an integer both 
series are infinite, and the ratio of the coefficient of to that of g k ~ 2 is 


a k _ 2 (k — n — 2) 
a k - 2 k(k - 1) 


(5.20) 


Here k = 2, 4, 6, . . . for v e and k = 1, 3, 5, . . . for v a . For a fixed value of n, other 
than n = even for v e and n = odd for v 0 , all terms with k > n + 2 have the same 
sign. Furthermore, it is easy to verify that if k > C(n + 2), where C is an arbitrary 
constant C > 1, the ratio (5.20) is always greater than the corresponding ratio 
of coefficients in the power series expansion of exp[(l — 1 /C)£ 2 ] or 

g exp [(1 — 1/C)f 2 ], depending on whether k is even or odd. If C is chosen to be 
greater than 2, it follows from (5.15) that for large values of |£| the eigenfunction 
i/f(f) diverges faster than exp[(l/2 — 1/C)£ 2 ] or £ exp[(l/2 — 1/C)£ 2 ], respectively. 

Hence, unless n is an integer, </f(£) diverges as |£| — » Such wave functions 
are physically not useful, because they describe a situation in which it is overwhelm- 
ingly probable that the particle is not “here” but at infinity. This behavior can be 
avoided only if n is an integer, in which case one of the two series (5.19) terminates 
and becomes a polynomial of degree n. 
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If n is even, we get 

<K£) = ^ /2 vM) (5.21) 

and the state has even parity. If n is odd, we get 

<K£) - *- f2/2 vM) (5-22) 

and the state has odd parity. Both (5.21) and (5.22) are now finite everywhere and 
quadratically integrable. 

We have come to a very important conclusion: The number n must be a non- 
negative integer ( n — 0, 1,2,.. .); hence, E can assume only the discrete values 
(5.18) 


E„ = 


ho 



1 , 3 5 

or E — - nco, - run, - run , . . . 

2 2 2 


(5.23) 


Classically, all nonnegative numbers are allowed for the energy of a harmonic os- 
cillator. In quantum mechanics, a stationary state of the harmonic oscillator can have 
only one of a discrete set of allowed energies! The energies are thus indeed quan- 
tized, and we may speak of a spectrum of discrete energy levels for the harmonic 
oscillator. The equally spaced energy levels of the harmonic oscillator, familiar to 
all students of physics since the earliest days of quantum theory, are sometimes 
referred to as the steps or rungs of a ladder. 

The numbers E n are the eigenvalues of the Schrodinger equation 


ft 2 d 2 ip 
2m dx 2 


+ - m(o 2 x 2 ip = Ei{f 


(5.24) 


and of the Hamiltonian operator 



(5.25) 


The corresponding solutions are the eigenfunctions. The eigenfunctions (5.21) 
and (5.22) decay rapidly at large distances, so that the particle is confined to the 
neighborhood of the center of force. The states described by such eigenfunctions 
are said to be bound. Note that the eigenvalues of the linear harmonic oscillator are 
not degenerate, since for each eigenvalue there exists only one eigenfunction, apart 
from an arbitrary constant factor. This property of the one-dimensional Schrodinger 
equation follows from the discussion of Section 3.5 [case (b)]. That the eigenvalues 
are equidistant on the energy scale is one of the important peculiar features of the 
x 2 dependence of the oscillator potential. 

The nonnegative integers n are the eigenvalues of the operator 


He 


11,1 

— — - Pf - 1 — 
2 2 s 2 



(5.26) 


as defined in (5.7). Because its eigenvalues are n = 0, 1, 2, 3, ... , the operator 
— 1/2 is called the number operator. It measures how many steps of the energy 
level ladder of the harmonic oscillator have been excited. Depending on the physical 
context to which the harmonic oscillator formalism is applied, the state of excitation 
n is said to represent the presence of n quanta or particles. An alternative and very 
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general derivation of the eigenvalues and eigenfunctions (or better, eigenvectors or 
eigenstates) of the number operator, which makes use only of the algebraic prop- 
erties of the operators £ and p 6 and their commutator (5.1 1), will be found in Section 
10 . 6 . 

As a label of the eigenvalues and eigenfunctions of the ordinary linear harmonic 
oscillator, the number n is traditionally and blandly called a quantum number. Its 
minimum value, n — 0, corresponds to the ground state, but the energy of the os- 
cillator is still h(o/2 and does not vanish as the lowest possible classical energy 
would. Since in a stationary state, { H ) = E, a zero energy eigenvalue would imply 
that both (px) = 0 and (x 2 ) = 0. As in Section 4.1, it would follow that both 
p x il> = 0 and xi ft - 0. But no if/ exists that satisfies these two equations simulta- 
neously. The energy h(o! 2 is called the zero-point energy of the harmonic oscillator. 
Being proportional to h, it is obviously a quantum phenomenon; it can be understood 
on the basis of the uncertainty principle (Chapter 10). 


3. Study of the Eigenfunctions. In this section, a few important mathematical 
properties of the harmonic oscillator eigenfunctions will be derived. 

The finite polynomial solutions of (5.16) which can be constructed if n is an 
integer are known as Hermite polynomials of degree n. The complete eigenfunctions 
are of the form 


<Pn(x) = C n H n 




(5.27) 


where H n denotes a Hermite polynomial of degree n, and C„ is an as yet undeter- 
mined normalization constant. But first the Hermite polynomials themselves must 
be normalized. It is traditional to define them so that the highest power of £ appears 
with the coefficient 2 n . Hence, by comparing with (5.19), we see that for even n. 


H n {0 = (- 1)" /2 


n\ 

(n/2)! 


v e (i) 


(5.28) 


and for odd n, 


H (t) = (-l-fn-vn 

n{i) 1 ; [{n - l)/2] ! 

Here is a list of the first few Hermite polynomials: 


Ho(& = 1 

H 3 (0 = -12£ + 8£ 3 

ffi(f) = n 

H 4 (0 =12- 48£ 2 + 16£ 4 

H 2 (& = -2 + 4£ 2 

H s (€) = 120^ - 160^ 3 + 32^ 5 


(5.29) 


(5.30) 


They satisfy the differential equation 


d 2 H n (Q _ dHJf) 

de * d£ 


+ 2 nH n (0 = 0 


(5.31) 


The first few harmonic oscillator eigenfunctions are plotted in Figure 5.1 and 5.2. 


Exercise 5.4. Prove that 


dH n (f) 


2nH n -}(f) 


(5.32) 
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Figure 5.1. The energy eigenfunctions of the linear harmonic oscillator, for the quantum 
numbers n = 0 to 5. The functions if/„ are plotted versus g = S/nuo/h x and are normalized 
as /| iff n (g)\ 2 dg = 1. The vertical axis is the energy in units of ho. 


A particularly simple representation of the Hermite polynomials is obtained by 
constructing the generating function 




n= 0 n\ 


As a consequence of the relation (5.32), we see that 


dF(g, s) 

dg 


2s F(g, s ) 


(5.33) 
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Figure 5.2. The three lowest energy eigenfunctions (n - 0, 1, 2) for the linear harmonic 
oscillator in relation to the potential energy, V(x) = mo? x 2 /2 = hag 2 / 2. The intercepts of 
the parabola with the horizontal lines are the positions of the classical turning points. 


This differential equation can be integrated: 

F{t s) = F( 0, s) e 2s * 

The coefficient F( 0, s ) can be evaluated from (5.33) and (5.28): 


F(0, s ) 


CO 


2 





and therefore the generating function has the form 

Fit s) = e ~ s2+2s€ = (5.34) 

The generating function Fit s) is useful because it allows us to deduce a number 
of simple properties of the harmonic oscillator wave functions with ease. For ex- 
ample, by Taylor’s expansion of (5.34): 

^ = (£ ■ ( -‘ r ^ (5 - 35) 


a popular alternative form of definition of the Hermite polynomials. 

From this definition it follows that all n roots of H n it) must be real. 


Proof. Assume that 
tend to zero as £, — » 


jn-lg-S _ 2 

r~ has n— 1 real roots. Since e s and all its derivatives 

d?~ l 

d n e~ ^ 

±o°, the derivative — - must have at least n real roots. Being 

dij n 
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e ~ f2 times a polynomial of degree n, it can have no more than n such roots. The 
assumption holds for n — 1, whence the assertion follows by induction. 

The points in coordinate space at which a wave function goes through zero are 
called its nodes. Evidently, the oscillator eigenfunction (5.27) has n nodes. 

From the generating function we derive conveniently the value of the useful 
integral 


I nkp = f H n (f)H k (f)e~^^ d£ 


(5.36) 


for nonnegative integers n, k, p. To this end we construct the expression 


/: 


e £ 2 -(s-(f e £ 2 -(t-& 2 e 2\£-g 2 


°° ® „n f k r+oo 

d£= S X -777 H n (OH k (Oe 2XS ~ e2 d£ 
„=o *-o n\k\ J-® 


where (5.33) and (5.34) have been used. The left-hand side can be integrated ex- 
plicitly; it equals 

r + oo 

e 2st+K 2 +2\(s+t) I £ -(s+t+ A-£) 2 g A 2 +2(i<+As+A0 


Hence, 


oo oo 




p — 0 n — 0 k=0 


nkp 


s n t k (2A y 
n\ k\ p\ 


\f~Tr gS 2 +2(st+ \s+ \t) 


(5.37) 


Comparing the coefficients of equal powers of s n t k X p , we obtain the value of I nkp . 

In particular, for p = 0, we verify that the oscillator energy eigenfunctions for 
n i 1 k are orthogonal, as expected. For n = k, we obtain the integral 

Inno = | M [H n (f)fe~ e d£ = 2" nW/rr 


In terms of the variable £, the orthonormality of the eigenfunctions is expressed as 

— f H n (OH k (Oe-* 2 df = 8 nk (5.38) 

2" nlVir J-® 


If we recall that £ 



x, we have for the normalized eigenfunctions 



and the orthonormality relation 


j: 


f!(#tW = s nk 


(5.39) 


(5.40) 


For the sake of generality, complex conjugation appears in (5.40), although with the 
particular choice of the arbitrary phase factor embodied in (5.39) the eigenfunctions 
are real. 
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In many calculations, we will need to know the matrix elements of the operator 



iff* (x)x p if/ k (x) dx 


(5.41) 


the ‘ ‘bra-ket’ ’ on the left-hand side of this equation is introduced here merely as a 
notational shorthand. Its name and its significance will be explained in due course. 
For use in the next section, we record the value of (5.41) for p = 1: 


(n\x\k) 


f- 


tf/ n (x)xif/ k (x) dx = — 



(5.42) 


Exercise 5.5. From (5.36) and (5.37), work out the matrix elements (n|x|k) 
and (n\x 2 \k) for the harmonic oscillator. 

Integral representations of the special functions that we encounter in quantum 
mechanics are often very useful. For the Hermite polynomials the integral represen- 
tation 

H n (0 = ~y= [ (€+ iu) n e~ u2 du (5.43) 

V 77 J-°° 


is valid. 


Exercise 5.6. Validate (5.43) by verifying that it satisfies (5.32) and by check- 
ing the initial values H n { 0). Alternatively, show that (5.43) can be used to verify 
the formula (5.33) for the generating function of Hermite polynomials. 


If we let s — ri + iu in the generating function (5.33), multiply the equation 
by e~ u , and integrate over u from -oo to +°°, we obtain from (5.43) 

* + c 


*t oo 00 

F(g, j] + iu)e ~ u 2 du = Vtt X 

K=C 


H n (t)H n (v) 


n = o n\ 2” 

and by (5.34) this expression equals 

r+ oo 

e -( V +iu) 2 + 2£( V + iu)-u 2 du _ 7J -g-V 2 + 2^§(^ _ 


Consequently, 


1 


H n (Z)e-e 2,2 H n ( m)e-^ 12 = 8(£ - v ) 


„=o 2" n\V^r 

Translated into the x representation, this is the closure relation (4.42): 


(5.44) 


2 4>n (x')^n W = - *') (5.45) 

71 = 0 

This relation shows that the harmonic-oscillator eigenfunctions constitute a complete 
set of orthonormal functions in terms of which an arbitrary function of x can be 
expanded. The fundamental expansion postulate of quantum mechanics is thus 
shown to be valid for the energy eigenfunctions of the harmonic oscillator. 
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Exercise 5.7. In the generating function (5.33), replace s by s — t (17 + iu) 
and prove Mehler’s formula 


lo i^ = VT=7 


exp 


2gr, - f 2 (g 2 + V 2 ) 

1 - t 2 


(5.46) 


In this section we have derived a number of mathematical results pertaining to 
the solution of the Schrodinger equation for the linear harmonic oscillator. Although 
the physical significance of some of these formulas will become apparent only later, 
it seemed efficient to compile them here in one place. 

4. The Motion of Wave Packets. So far we have considered only the stationary 
states of the harmonic oscillator. We now turn our attention to the behavior of a 
general wave i//(x, t ) whose initial form i}/(x, 0) is given. The time-dependent Schro- 
dinger or wave equation 


d<K*, t) 

in 

dt 


h 2 d 2 iff(x, t) mco 


2m dx 2 


— x 2 ij/(x, t ) 


(5.47) 


determines the time development of the wave. In Chapters 3 and 4 we saw that the 
solution of this equation can be obtained automatically, if the initial wave can be 
expanded in terms of the time-dependent eigenfunctions of the corresponding (time- 
independent) Schrodinger equation. The completeness of the orthonormal energy 
eigenfunctions of the harmonic oscillator was proved in the last section. Accord- 
ingly, if 1 l/(x, 0), which we assume to be normalized to unity, is expanded as 


>P(X, 0) - X Cn'l'nto 

K ~0 

with the expansion coefficients given by 

/•+ 00 

C„ = I <A* (x)tjj(x, 0) dx 


(5.48) 


(5.49) 


then, knowing that for the harmonic oscillator, E n = + -J, we can construct 

the wave packet at time t by the use of (3.79): 

/ . \ ~ 

icot 


if/(x, t) = exp( I 2 c n l K to exp (-inwt) 

2 ) n—0 


(5.50) 


The center of probability of the normalized wave packet, i.e., the expectation value 
of the position operator x, is according to (3.15) 


/» 00 

(x) t = I x| 1 j/(x, t) | 2 dx 

Substituting (5.50) into (5.51), we find 

(x) t = S 2 c* n c k e i(n ~ k)0>t J ift(x)xip k (x) dx 


(5.51) 


n = 0 k = 0 

00 00 


(5.52) 


= S I ctc k e Kn k)o>t (n\x\k) 


n = 0 k = 0 
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which shows that the matrix elements of x, obtained in (5.42), enter critically. If 
(5.42) is substituted in (5.52), the selection rule for the matrix elements (rc|x|£), 

n - k = ±1 (5.53) 


simplifies the summations and gives the result 

(x), = J-L- £ + ci.tc.e- 1 -*) 

y 2,f7i(o n — i 

If we set 




(5.54) 


we can write 




cos(o)t + <f > n - 1 — 4> n ) 


(5.55) 


This expression is exact. It shows that the expectation value of the coordinate, < x ) t , 
oscillates harmonically with frequency a), just like the classic coordinate x(r). 


Exercise 5.8 . Use (5.32), (5.40), and (5.42) to calculate the matrix elements 
of the momentum operator, 


(n\p x \k) = 


. ft d Mx) , 

OTT -; — dx 
i dx 


With this result, evaluate ( p x ), as a function of time for the wave packet (5.50). 

Verify that {p x ) t = m — (x), for this wave packet. 

dt 


Exercise 5.9. Verify that (5.55) is expressible as 

(x), = (x) 0 cos (jit + sin cot 

mo 

which can also be derived directly from the equation of motion for (x) t (see Problem 
3 in Chapter 3). 


Problems 

1. Calculate the matrix elements of p x 2 with respect to the energy eigenfunctions of the 
harmonic oscillator and write down the first few rows and columns of the matrix. Can 
the same result be obtained directly by matrix algebra from a knowledge of the matrix 
elements of pj 

2. Calculate the expectation values of the potential and kinetic energies in any stationary 
state of the harmonic oscillator. Compare with the results of the virial theorem. 

3. Calculate the expectation value of x 4 for the n-th energy eigenstate of the harmonic 
oscillator. 

4. For the energy eigenstates with n = 0,1, and 2, compute the probability that the 
coordinate of a linear harmonic oscillator in its ground state has a value greater than 
the amplitude of a classical oscillator of the same energy. 
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5. Show that if an ensemble of linear harmonic oscillators is in thermal equilibrium, 
governed by the Boltzmann distribution, the probability per unit length of finding a 
particle with displacement of jc is a Gaussian distribution. Plot the width of the dis- 
tribution as a function of temperature. Check the results in the classical and the low- 
temperature limits. [Hint: Equation (5.43) may be used.] 


6. Use the generating function for the Hermite polynomials to obtain the energy eigen- 
function expansion of an initial wave function that has the same form as the oscillator 
ground state but that is centered at the coordinate a rather than the coordinate origin: 

N / ma)(x — a) 2 \ 

exp - 


0 ) 



2 h 


(a) For this initial wave function, calculate the probability P„ that the system is 
found to be in the n-th harmonic oscillator eigenstate, and check that the P n add up 
to unity. 

(b) Plot P n for three typical values of a, illustrating the case where a is less 

FT 

than, greater than, and equal to / . 


(c) If the particle moves in the field of the oscillator potential with angular 
frequency co centered at the coordinate origin, again using the generating function 
derive a closed-form expression for ip(x, t). 


(d) Calculate the probability density | tp(x, t ) | 2 and interpret the result. 



CHAPTER 6 

Sectionally Constant Potentials 
in One Dimension 


Potentials like the rectangular barrier or the square well, which are pieced 
together from constant zero-force sections with sharp discontinuities, do 
not occur in nature but serve as convenient models. Classically, they are 
trivial, but here they are useful to exemplify characteristic quantum 
properties that arise from the smooth joining of the Schrodinger wave 
function (Section 3.5) at the discontinuities of the potential, such as 
tunneling and scattering resonances. The mathematics is relatively simple, 
so that we can concentrate on the physical features, especially the power 
of symmetry considerations. 

1 . The Potential Step. Of all Schrodinger equations, the one for a constant po- 
tential is mathematically the simplest. We know from Chapter 2 that the solutions 
are harmonic plane waves, with wave number 

k = V2 m(E - V ) 

We resume study of the Schrodinger equation with such a potential because the 
qualitative features of a real physical potential can often be approximated reasonably 
well by a potential that is made up of a number of constant portions. For instance, 
unlike the electrostatic forces that hold an atom together, the strong nuclear forces 
acting between protons and neutrons have a short range; they extend to some dis- 
tance and then drop to zero very fast. Figure 6. 1 shows roughly how a rectangular 
potential well — commonly called a square well — might simulate the properties of 
such an interaction. 

Often, such a schematic potential approximates the real situation and provides 
a rough orientation with comparatively little mathematical work. As we will see in 
Section 8.7, a sectionally constant periodic potential exhibits some of the important 
features of any periodic potential seen by an electron in a crystal lattice. 

The case of the free particle, which sees a constant potential V(x) = const., for 
all x, in three dimensions as well as one dimension with and without periodic bound- 
ary conditions, was already discussed in Section 4.4. 

Next in order of increasing complexity is the potential step V(x) = V 0 iq(x) as 
shown in Figure 6.2. There is no physically acceptable solution for E < 0 because 
of the general theorem that E can never be less than the absolute minimum of V(x). 
Classically, this is obvious. But as the examples of the harmonic oscillator and the 
free particle have already shown us, it is also true in quantum mechanics despite 
the possibility of penetration into classically inaccessible regions. We can prove the 
theorem by considering the real solutions of Schrodinger’ s equation (see Exercise 
3.20): 

~ >P"(x) + [V(x) - E]ip(x) = 0 
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Figure 6.1. Potential approximating the attractive part of nuclear forces 
(V = — e~ w t\x\) and a one-dimensional square well simulating it. 


If V{x ) > E for all x, ip" has the same sign as ip, everywhere. Hence, if ip is positive 
at some point x, the wave function has one of the two convex shapes shown in Figure 
6.3, depending on whether the slope is positive or negative. In Figure 6.3a, ip can 
never bend down to be finite as x — > +°°. In Figure 6.3 b, ip diverges as * — » — °°. 
To avoid these catastrophes, there must always be some region where E > V(x ) and 
where the particle can be found classically. 


Exercise 6.1. Prove that E must exceed the absolute minimum value of the 
potential V(x) by noting that E = (H) in the stationary state ip E (x). 

Now we consider the potential step with 0 < E < y 0 . Classically, a particle of 
this energy, if it were incident from the left, would move freely until reflected at 
the potential step. Conservation of energy requires it to turn around, changing the 
sign of its momentum. 





Figure 6.2. Energy eigenfunction for the (Heaviside) step potential function 

V(jc) = V 0 corresponding to an energy E = V 0 /2. The step function jj(x) is defined in 

Section 1 of the Appendix. The normalization is arbitrary. 
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Figure 6.3. Convex shape of the wave function in the nonclassical region (< j/'lty > 0). 


The Schrodinger equation has the solution 


<M*) = 


Ue ikx + Be~ ikx 
[Ce~ KX 


(x < 0) 
(x > 0) 


Here 

hk — V2mE hK = V2m(V 0 “ E) 


( 6 . 1 ) 


Since iJj(x) and its derivative </d(x) approach zero as x + °°, according to case (b) 
in Section 3.5 there is no degeneracy for E < V 0 . The second linearly independent 
solution for x > 0, e KX , is in conflict with the boundary condition that i fi(x) remain 
finite as x — > + o° . 

By joining the wave function and its slope smoothly at the discontinuity of the 
potential, * = 0, we have 

A + B = C 
ik(A ~ B) = —kC 


or 


B 

A 

C 

A 


ik + k 
ik — k 
2 ik 

ik — k 


= e 


ia 


= 1 + e ia 


(a: real) 


( 6 . 2 ) 


Substituting these values into (6.1), wc obtain 


ijj(x) = ■{ 


2 Ae ia/2 cos^fcc — — j 

(x < 0) 

2 Ae ia/2 cos ^ 

2 

(x > 0) 


(6.3) 


in agreement with the remark made in Section 3.5 that the wave function in the case 
of no degeneracy is real, except for an arbitrary constant factor. Hence, a graph of 
such a wave function may be drawn (Figure 6.2). The classical turning point (jc = 0) 
is a point of inflection of the wave function. The oscillatory and exponential portions 
can be joined smoothly at x — 0 for all values of E between 0 and the energy 
spectrum is continuous. 

The solution (6.1) can be given a straightforward interpretation. It represents a 
plane wave incident from the left with an amplitude A and a reflected wave that 
propagates toward the left with an amplitude B. According to (6.2), |A| 2 — | Bp; 
hence, the reflection is total. A wave packet which is a superposition of eigenfunc- 
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tions (6.1) could be constructed to represent a particle incident from the left. This 
packet would move classically, being reflected at the wall and again giving a van- 
ishing probability of finding the particle in the region of positive x after the wave 
packet has receded; there is no permanent penetration. 

Perhaps these remarks can be better understood if we observe that for one- 
dimensional motion the conservation of probability leads to particularly transparent 
consequences. For a stationary state, Eq. (3.3) reduces to djldx — 0. Hence, the 
current density 


h 

2m i 



dip* 

dx 




(6.4) 


has the same value at all points x. When calculated with the wave functions (6.3), 
the current density j is seen to vanish, as it does for any essentially real wave func- 
tion. Hence, there is no net current anywhere at all. To the left of the potential step, 
the relation \A | 2 = \B | 2 ensures that incident and reflected probability currents cancel 
one another. If there is no current, there is no net momentum in the state (6.1). 


Exercise 6.2. Show that for a wave function ip(x) — Ae lkx + Be~ lkx , the current 
density j can be expressed as the sum of an incident and a reflected current density, 
j ~ jinc + jre f> without any interference terms between incident and reflected waves. 


The case of an infinitely high potential barrier (V 0 — » °° or k — > °o) deserves 
special attention. From (6.1) it follows that in this limiting case ip(x) — » 0 in the 
region under the barrier, no matter what value the coefficient C may have. According 
to (6.2), the joining conditions for the wave function at x = 0 now reduce formally 
to 


B 

lim - = - 1 

K— >oo A 



or A + B = 0 and C — 0 as V 0 — > °o . These equations show that at a point where 
the potential makes an infinite jump the wave function must vanish, whereas its slope 
jumps discontinuously from a finite value (2 ikA) to zero. 

We next examine the quantum mechanics of a particle that encounters the po- 
tential step in one dimension with an energy E > V 0 . Classically, this particle passes 
the potential step with altered velocity but no change of direction. The particle could 
be incident from the right or from the left. The solutions of the Schrodinger equation 
are now oscillatory in both regions; hence, to each value of the energy correspond 
two linearly independent, degenerate eigenfunctions, as discussed in case (a) in Sec- 
tion 3.5. For the physical interpretation, their explicit construction is best accom- 
plished by specializing the general solution: 


ip{x) = 


Ae ikx + Be~ ikx 
Ce ik ' x + De~ iklX 


(x < 0) 
(jc > 0) 


(6.5) 


where 

hk — V2 mE and hk x — \/2m(E — V 0 ) 


Two useful particular solutions are obtained by setting D - 0, or A — 0. The first 
of these represents a wave incident from the left. Reflection occurs at the potential 
step, but there is also transmission to the right. The second particular solution rep- 
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resents incidence from the right, transmission to the left, and reflection toward the 
right. 

Here we consider only the first case ( D — 0). The remaining constants are 
related by the condition for smooth joining at x = 0, 


from which we solve 


A + B = C 
k(A - B) = k x C 


B k — k x . C 2k 

— = and — = — 

A k + fcj A k + k x 


( 6 . 6 ) 


The current density is again constant, but its value is no longer zero. Instead, 

hk 
m 
fik x 
m 


J 


(UP 


B | 2 ) 


= < 


(. x < 0 ) 
(. x > 0 ) 


in agreement with Exercise 6.2. The equality of these values is assured by (6.6) and 
leads to the relation 



A | 2 


, *il£l! = 1 

k lAl 2 


(6.7) 


In analogy to optics, the first term in this sum is called the reflection coefficient, R, 
and the second is the transmission coefficient, T. We have 


A-f 

B 

A 

2 (k - k x f 
z (k + kff 


T kl 

\c 

'\ z 4 kki 

k 

U 

I 2 (k + fci) 2 . 


( 6 - 8 ) 

(6.9) 


Equation (6.7) ensures that R + T = 1. The coefficients R and T depend only on 
the ratio E/V 0 . 

For a wave packet incident from the left, the presence of reflection means that 
the wave packet may, when it arrives at the potential step, split into two parts, 
provided that its average energy is close to V 0 . This splitting up of the wave packet 
is a distinctly nonclassical effect that affords an argument against the early attempts 
to interpret the wave function as measuring the matter (or charge) density of a par- 
ticle. For the splitting up of the wave packet would then imply a physical breakup 
of the particle, and this would be very difficult to reconcile with the facts of obser- 
vation. After all, electrons and other particles are always found as complete entities 
with the same distinct properties. On the other hand, there is no contradiction be- 
tween the splitting up of a wave packet and the probability interpretation of the wave 
function. 


Exercise 6.3. Show that, for a given energy E, the coefficients for reflection 
and transmission at a potential step are the same for a wave incident from the right 
as for a wave incident from the left. Note that the relative phase of the reflected to 
the incident amplitude is zero for reflection from a rising potential step, but tt for 
reflection from a sharp potential drop. 
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Figure 6.4. Rectangular potential barrier, height V 0 , width 2a. 


2. The Rectangular Potential Barrier. In our study of more and more compli- 
cated potential forms, we now reach a very important case, the rectangular potential 
barrier (Figure 6.4). There is an advantage in placing the coordinate origin at the 
center of the barrier so that V(jc) is an even function of x. Owing to the quantum 
mechanical penetration of a barrier, a case of great interest is that of E < V 0 - The 
particle is free for x < —a and x > a. For this reason the rectangular potential 
barrier simulates, albeit schematically, the scattering of a free particle from any 
potential. 

We can immediately write down the general solution of the Schrodinger equa- 
tion for E < V Q : 


'Ae ikx + Be~ ikx 

(x < —a) 


Ce - ** + De ** 

(—a < x < a) 

(6.10) 

Fe ikx + Ge~ ikx 

(a < x) 



where again hk = V2 mE, Hk = V2m(V 0 — £)• The boundary conditions atx = —a 
require 


Ae~ ika + Be ika 
Ae ~ ika - Be ika 


= Ce Ka + De Ka 

= v ( Ce Ka - De ~ Ka ) 
k 


( 6 . 11 ) 


These linear homogeneous relations between the coefficients, A, B, C, D are con- 
veniently expressed in terms of matrices: 




\ s / 


[R 


V 


1 - 


IK 


_ Ka + ika 


n Ka—ika 


1 - — \ e - Ka+ika 


1 + — } e -«a-ika 




\ D ! 


The joining conditions at x — a are similar. They yield 


/ 


C 

D 


A 


^ \^Ka+ika 

x) 

1 + l ^\ e - Ka+ika 


1 + -j e Ka ~ ika 
1 - l -\ e ~ Ka ~ ika 




*/ 


/ 


\ G / 
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Combining the last two equations, we obtain the relation between the wave function 
on both sides of the barrier: 



cosh 2 Kfl + — sinh 2 Ka \e 2lka 
2 ) 

ir\ • , „ 

— — sznh 2 Ka 
2 


17} . , „ 

— sxnh 2ko 
2 


cosh 2ko 



sinh 2ko \e 


where the abbreviated notation 


k k 
e ~ k ~ k ’ 


k k 

i? = T + “ 

K K 


( 6 . 12 ) 


(6.13) 


has been used. Note that rj 2 ~ e 2 = 4. 


Exercise 6.4. Calculate the determinant of the 2 X 2 matrix in (6.12). 

A particular solution of interest is obtained from (6.12) by letting G = 0. This 
represents a wave incident from the left and transmitted through the barrier to the 
right. A reflected wave whose amplitude is B is also present. We calculate easily: 

P ~2ika 

- (6.14) 

A cosh 2 Ka + z(s/2) sinh 2 Ka 

The square of the absolute value of this quantity is the transmission coefficient for 
the barrier. It assumes an especially simple form for a high and wide barrier, which 
transmits poorly, such that Ka » 1. In first approximation, 

cosh 2 Ka ~ sinh 2 Ka e 2Ka /2 


Hence, 


k 2 + k 2 


(6.15) 


Another limiting case is that of a very narrow but high barrier such that V 0 » E, 
k » k, and Ka « 1, but V Q a or K 2 a is finite. Under these conditions, 

F 2 k 2 E 

T = A Ic 2 + k 4 o 2 = ~ 2m' (6 ' 16) 

E + W 


(6.16) 


E + TT V° a 


If the “area” under the potential is denoted by 

g = 2 lim F 0 a 

a— >0 

V(jr-»“> 


(6.17) 


the potential may be represented by a delta function positioned at the origin, 

V(x) = gS(x) (6.18) 

and the transmission through this potential barrier is 


(6.19) 
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The matrix that connects A and B with F and G in (6.12) has very simple 
properties. If we write the linear relations as 


/A\ = /«i + ip { a 2 + z'/6 2 \/>\ 
\.B/ ^<*3 + ifi 3 (X 4 + ip4/\G/ 


( 6 . 20 ) 


and compare this with (6.12), we observe that the eight real numbers a t and p t in 
the matrix satisfy the conditions 


= « 4 , Pi = ~p4, a 2 = a 3 = 0 , p 2 = -p 3 ( 6 . 21 ) 


These five equations reduce the number of independent variables on which the matrix 
depends from eight to three. As can be seen from (6.12) and Exercise 6.4, we must 
add to this an equation expressing the fact that the determinant of the matrix is equal 
to unity. Using (6.21), this condition reduces to 


«? + P\ ~ Pi = 1 


( 6 . 22 ) 


Hence, we are left with two parameters, as we must be, since the matrix depends 
explicitly on the two independent variables ka and Ka. 


Exercise 6.5. If the matrix elements are constrained by (6.21) and (6.22), show 
that (6.20) can be written as 


/ e lv cosh A i sinh A \ / F 
\—i sinh A e~ ,v cosh A / \G 


(6.23) 


where A and v are two real parameters. For the delta-function barrier (6.18), identify 
A and v in terms of g and k. More generally, verify that (6.12) has the form (6.23). 

In the next section it will be shown that the conditions (6.21) and (6.22) imposed 
on (6.20), rather than pertaining specifically to the rectangular-shaped potential, are 
consequences of very general symmetry properties of the physical system at hand. 


3. Symmetries and Invariance Properties. Since the rectangular barrier of Figure 
6.4 is a real potential and symmetric about the origin, the Schrodinger equation is 
invariant under time reversal and space reflection. We can exploit these properties 
to derive the general form of the matrix linking the incident with the transmitted 
wave. 

We recapitulate the form of the general solution of the Schrodinger equation: 


'Ae ikx + Be~ ikx 

(x < —a) 


Ce~ KX + De KX 

(—a < x < a) 

(6.10) 

Fe ikx + Ge~ ikx 

{a < jc) 



The smooth joining conditions at x = —a and x = a lead to two linear homogeneous 
relations between the coefficients A, B, F, and G, but we want to see how far we 
can proceed without using the joining conditions explicitly. If we regard the wave 
function on one side of the barrier, say for x > a, as given, then the coefficients A 
and B must be expressible as linear homogeneous functions of F and G. Hence, a 
matrix M exists such that 

(A\ = / Mu M 12 \/F\ 

\BJ \M 21 M„J[g) 


(6.24) 
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An equivalent representation expresses the coefficients B and F of the outgoing 
waves in terms of the coefficients A and G of the incoming waves by the matrix 
relation 



(6.25) 


Whereas the representation in terms of the S matrix is more readily generalized to 
three-dimensional situations, the M matrix is more appropriate in one-dimensional 
problems. On the other hand, the symmetry properties are best formulated in terms 
of the S matrix. 

The S and M matrices can be simply related if conservation of probability is 
invoked. As was shown in Section 6.1, in a one-dimensional stationary state, the 
probability current density j must be independent of x. Applying expression (6.4) to 
the wave function (6.10), we obtain the condition 

|A| 2 — \B\ 2 = |F| 2 — |G| 2 or \B\ 2 + \F\ 2 = \A\ 2 + \G\ 2 

as expected, since \A\ 2 and \F \ 2 measures the probability flow to the right, while 
\B\ 2 and \ G\ 2 measure the flow in the opposite direction. Using matrix notation, we 
can write this as 


(B* F*)(^j = 

where S denotes the transpose matrix of S, and S* the complex conjugate. It follows 
that S must obey the condition 


(A* G*)S*S 


= (A* G*) 



(6.26) 


Since the Hermitian conjugate of the matrix S is defined by 



(6.27) 


Equation (6.26) implies the statement that the inverse of S must be the same as its 
Hermitian conjugate. Such a matrix is said to be unitary. 

For a 2 X 2 matrix S, the unitarity condition (6.26) implies the following con- 
straints: 


|Sn! = |S 22 | and |S I2 | = |S 21 | (6.28) 

I Sn| 2 + |S 12 | 2 - 1 (6.29) 

and 

SiiSt 2 + 5 21 S| 2 = 0 (6.30) 


Exercise 6.6. Verify that the conditions (6.28), (6.29), and (6.30) follow from 
(6.26). 


Since the potential is real, the Schrodinger equation has, according to Section 
3.5, in addition to (6.10), the time-reversed solution, 


'A*e~ ikx r + B*e ikx 
. c*e~ KX + Lrie** 
F*e~ ikx + G*e ikx 


(x < —a) 

(— a < x < a) 
(a < x ) 


* M*) = >ff*(x) = 


(6.31) 
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Comparison of this solution with (6.10) shows that effectively the directions of 
motion have been reversed and the coefficient A has been interchanged with B*, and 
F with G*. Hence, in (6.25) we may make the replacements A *-> B* and F <-» G* 
and obtain an equally valid equation: 


(6.32) 




Equations (6.32) and (6.25) can be combined to yield the condition 

5*5 = 1 


(6.33) 


This condition in conjunction with the unitarity relation (6.26) implies that the 5 
matrix must be symmetric as a consequence of time reversal symmetry: 


5-5 


(6.34) 


If 5 is unitary and symmetric, it is easy to verify by comparing Eqs. (6.24) and 
(6.25) that the M matrix assumes the form: 


AT = 


subject to the condition: 


det M = 


/j_ 

o* \ 
^11 

5l2 

oH= 

^12 

Su 

1 

(Sl2 

S* l2 / 

1 - 1 

5nl 2 


(6.35) 


= 1 


'12 I 


Since the potential is an even function of x, another solution is obtained by 
replacing x in (6.10) by — x. This substitution gives 

(Ae~ ikx + Be ikx (x > a) 

Ce KX + De~ KX (a > x > -a) (6.36) 

Fe~ ikx + Ge ikx (-a > x) 


<Mx) = i K~x) = 1 


Now, Ge ,kx is a wave incident on the barrier from the left, Be lkx is the corresponding 
transmitted wave, and Fe~' kx is the reflected wave. The wave Ae~ lkx is incident from 
the right. Hence, in (6.25) we may make the replacements A G and B F 
and obtain 



For comparison with (6.25) this relation can also be written as 

/ S 2 2 S 2 i 
\5i2 5 U/ 

Hence, invariance under reflection implies the relations 

5ji S 2 2 5j2 52i 



(6.37) 


If conservation of probability, time reversal invariance, and invariance under space 
reflection are simultaneously demanded, the matrix M has the structure 


M n = 2 , M, 2 = —Mt 2 = — M 2 1 = Ml i, det M = 1 


(6.38) 
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Exercise 6.7. Verify the relations (6.38). Check that they are satisfied by the 
M matrices in Eqs. (6.12) and (6.23) and that they are equivalent to conditions (6.21) 
and (6.22). 


We thus see that the conditions (6.21) and (6.22) can be derived from very 
general properties without knowledge of the detailed shape of the potential. These 
general properties are shared by all real potentials that are symmetric with respect 
to the origin and vanish for large values of |jc|. For all such potentials the solution 
of the Schrddinger equation must be asymptotically of the form 




Ae ikx + Be~ ikx 
Fe ikx + Ge~ ikx 


(*-> -°°) 
(x -» + 00 ) 


By virtue of the general arguments just advanced, these two portions of the eigen- 
functions are related by the equation 


( A \ = ( a i + i Pi *02 \/>\ 

\b) \ -ijs 2 oty - ipJ\Gj 


(6.39) 


with real parameters a l5 /3 X , and /3 2 subject to the additional constraint 

a\ + /3? - J8i = 1 (6.22) 

The same concepts can be generalized to include long-range forces. All that is 
needed to define a matrix M with the properties (6.39) and (6.22) is that the Schro- 
dinger equation admit two linearly independent fundamental solutions that have the 
asymptotic property 


«M“*) = < h(x) = <AT(x) (x-* ±oo) 

For a real even potential function V(x), this can always be accomplished by choosing 

(X) = l/fevenW + ty, odd(*)> = t/'evenC*) ~ *<Aodd(*) 

where i // even and if/ odd are the real-valued even and odd parity solutions defined in 
Section 5.2. 

Although the restrictions that various symmetries impose on the S or M matrix 
usually complement each other, they are sometimes redundant. For instance, in the 
simple one-dimensional problem treated in this section, invariance under reflection, 
if applicable, guarantees that the S matrix is symmetric [see the second Eq, (6.37)], 
thus yielding a condition that is equally prescribed by invariance under time reversal 
together with probability conservation. 

It should also be noted that the principle of invariance under time reversal is 
related to probability conservation and therefore to unitarity. If no velocity- 
dependent interactions are present, so that V is merely a function of position, the 
reality of V ensures invariance under time reversal and implies conservation of prob- 
ability. Velocity-dependent interactions, as they occur for instance in the presence 
of magnetic fields, can break time reversal symmetry without violating conservation 
of probability (unitarity). 

The matrix method of this section allows a neat separation between the initial 
conditions for a particular problem and the matrices S and M, which do not depend 
on the structure of the initial wave packet. The matrices S and M depend only on 
the nature of the dynamical system, the forces, and the energy. Once either one of 
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these matrices has been worked out as a function of energy, all problems relating to 
the potential barrier have essentially been solved. For example, the transmission 
coefficient T is given by |F| 2 /|A| 2 if G = 0, and therefore 


T = 


\M U 


= 5 - 


21 


(6.40) 


Exercise 6.8. If V = 0 for all * (free particle), show that 


M = 




and 



(6.41) 


If for real-valued V ¥= 0, the departure of the S matrix from the form (6.41) is 
measured by two complex-valued functions of the energy, r and t, which are defined 
by 5 n = 2 ir and S 2 i = 1 + lit, prove the relation 

\r\ 2 + |r| 2 = Imr (6.42) 

The analogue of this identity in three dimensions is known as an optical theorem 
(Section 20.6). 


We will encounter other uses of the M and S matrices in the next section and 
the next chapter. Eventually, in Chapter 20, we will see that similar methods are 
pertinent in the general theory of collisions, where the S or scattering matrix plays 
a central role. The work of this section is S-matrix theory in its most elementary 
form. 


Exercise 6.9. Noting that the wave number k appears in the Schrodinger equa- 
tion only quadratically, prove that, as a function of k, the S matrix has the property 

S(k) S(-k) = 1 (6.43) 

Derive the corresponding properties of the matrix M, and verify them for the example 
of Eq. (6.12). 


Exercise 6.10. Using conservation of probability and invariance under time 
reversal only, prove that at a fixed energy the value of the transmission coefficient 
is independent of the direction of incidence. (See also Exercise 6.3.) 


4 . The Square Well. Finally, we must discuss the so-called square (or rectan- 
gular) well (Figure 6.5). It is convenient to place the origin of the x axis in the center 
of the potential well so that F(x) is again an even function of x: 


V(x) = 



for 

for 


—a < x < a 


x > a 


(V 0 > 0) 


Depending on whether the energy is positive or negative, we distinguish two 
separate cases. If E > 0, the particle is unconfined and is scattered by the potential; 
if E < 0, it is confined and in a bound state. Assuming - V 0 < E < 0, we treat this 
last case first and set 


hk' = y/lm(E + V 0 ), fiK = V~2mE 


(6.44) 
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i l>(x) 



Figure 6.5. Square well potential of width 2 a and depth V 0 - For the choice of the 
parameter /3 = 3 there are two bound states. In units of h 2 /2ma 2 , the well depth is V 0 = 9, 
and the energies for the two discrete states are E 0 = —7.63 and E x = —3.81. The ground 
and excited state eigenfunctions are equally normalized, but the scale is arbitrary. 


The Schrodinger equation takes the form 


d 2 \jj(x) 
dx 2 

d 2 if/(x) 

dx 2 


+ k' 2 i f/(x) = 0 


- k 2 if/ (x) = 0 


inside the well 
outside the well 


(E< 0) 


(6.45) 


As for any even potential, we may restrict the search for eigenfunctions to those of 
definite parity. 

Inside the well we have 


if/(x) = A' cos k'x for even parity 

if/(x) = B' sin k'x for odd parity 

Outside the well we have only the decreasing exponential 

iff(x) = CV-"I*I 

since the wave function must not become infinite at large distances. 

It is necessary to join the wave function and its first derivative smoothly at 
x — a, that is, to require, 


(6.46) 

(6.47) 


lim ij/(a — s) = lim if/(a + s) 

e — *0 e — ->0 

lim if/' (p — s) = lim if/'(a + s) 

e — >0 e — *0 
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Since an overall constant factor remains arbitrary until determined by normalization, 
these two conditions are equivalent to demanding that the logarithmic derivative 
of iff. 


1 dip(x) _ d log if/(x) 
ift(x) dx dx 


(6.48) 


be continuous at x = a. This is a very common way of phrasing the smooth joining 
conditions. Because of the reflection symmetry, the smooth joining conditions are 
automatically satisfied at x = —a for both even and odd eigenfunctions. 

The logarithmic derivative of the outside wave function, evaluated at x = a, is 
— k; that of the inside wave functions is —k' tan k'a for the even case and k' cot k'a 
for the odd case. 

The transcendental equations 

I 

k' tan k'a = k (even) 

(6.49) 

k' cot k'a — k (odd) 


permit us to determine the allowed eigenvalues of the energy E. 

The general symmetry considerations of Section 6.3 can also be extended to 
the solutions of the Schrodinger equation with negative values of k 2 and E. In (6.10) 
we need only replace k by Ik and k by ik' . The solution then takes the form 


<AM = 


'Ae*** + Be KX 

(x < —a) 


Ce~ ik ' x + De ik ' x 

(—a < x < a) 

(6.50) 

Fe~ KX + Ge KX 

(a < x) 



By requiring invariance under time reversal and imposing the principle of conser- 
vation of probability, we see that the matrix M, defined as in (6.24), must now be a 
real matrix with det M = 1 . The boundary conditions at large distances require that 
A ~ G = 0. In terms of the matrix M, we must thus demand that 


M n = 0 (6.51) 

and this equation yields the energy eigenvalues. Defining again an S matrix as in 
(6.25), it follows that the bound-state energies are poles of the S matrix. 


Exercise 6.11. Show that for the square well, reflection symmetry implies that 
the off-diagonal matrix elements of M are ±1, giving us the even and odd solutions, 
respectively. 

Exercise 6.12. By changing V 0 into — V 0 in (6.12), show that, for a square 
well, (6.51) is equivalent to the eigenvalue conditions (6.49). 


A simple graphical method aids in visualizing the roots of (6.49). We set 

X = (k'a) 2 , (3 = J 2m ^ a . (xa) 2 + x = P 2 

In Figure 6.6 we plot F(x ) - P 2 ~ X and 
f evcn(x) ~ X ta n 2 Vx (if tan V^sO) f Q dd(x) = X cot 2 Vx (if tan < 0) 
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as functions of the positive independent variable x ■ F° r the square well, the only 
pertinent parameter is the value of the dimensionless quantity p. The required roots 
are found by determining the intercepts of the straight line F(^) with the curves / even 
and / odd . The ordinates (ka) 2 of the intersection points are the scaled values of the 
bound-state energies. 

By inspection of Figure 6.6, we can immediately draw several conclusions: All 
bound states of the well are nondegenerate; even and odd solutions alternate as 
the energy increases; the number of bound states is finite and equal to N + 1, if 
Ntt < 2/3 < (N + l)7r; if the bound states are labeled in order of increasing energy 
by a quantum number n = 0, 1, ... N, even values of n correspond to even parity, 
odd values of n correspond to odd parity, and n denotes the number of nodes; for 
any one-dimensional square well there is always at least one even state, but there 
can be no odd states unless p > 7t/ 2; and, the level spacing increases with in- 
creasing n. 

As V 0 is allowed to increase beyond all bounds, two special cases merit dis- 
cussion: 

(a) Here we let V 0 — » 00 while keeping the width of the square well finite, 
so that j8 oo. For this infinitely deep potential well, the roots of the equations 



Figure 6.6. Graphic determination of the energy levels in a square well with p 2 = 30. The 
curves for tan V* 5: 0 (f even ) alternate with those for tan V* ^ 0 (f odd ). The ordinates of 
the intercepts are the binding energies in units of h 2 /2ma 2 . The dashed-line asymptotes 
intersect the abscissa at the energy eigenvalues (6.52), again in units h 2 H ma 2 , for a 
particle confined to the box -a ^ x £ a. This figure may be used as a template for 
estimating the bound-state energies for any one-dimensional square well: merely draw a 
parallel to the diagonal straight line F(\) — P 2 ~ X f° r the desired value of p. 
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expressing the boundary conditions are now simply the asymptotes in Figure 6.6, 
or 

V* = (n + 1) 7t/2 (n = 0, 1, 2, . . .) 


Hence, 


(6.52) 


The left-hand side of this expression, E + V 0 , is the distance in energy from the 
bottom of the well and represents the kinetic energy of the particle in the well. Since 
E _ oo as V 0 — > oo , it follows that k — > + °° ; therefore, the wave function itself 
must vanish outside the well and at the endpoints x = ±a. There is in this limit no 
condition involving the slope, which for an infinite potential jump can be discontin- 
uous. Taking into account a shift V Q of the zero of energy and making the identifi- 
cation 2 a = L, we see that the energy levels (6.52) for odd values of n coincide with 
the energy spectrum for a free particle whose wave function is subject to periodic 
boundary conditions (see Section 4.4). Note that the number of states is essentially 
the same in either case, since there is double degeneracy in (4.59) for all but the 
lowest level, whereas (6.52) has no degeneracy, but between any two levels (4.59) 
there lies one given by (6.52) corresponding to even values of n. There is, however, 
no eigenstate of the infinitely deep well at E + F 0 — 0 [corresponding to n — 0 in 
(4.59)], because the corresponding eigenfunction vanishes. 

(b) Another interesting special case arises if V 0 tends to infinity as a tends to 
zero, but in such a way that the product V 0 a remains finite. As in (6.17), we denote 
the area under the potential by g = lim(2V r 0 ^)» but instead of being a repulsive barrier 
the potential is now the attractive delta function well, 

V(x) = -g8(x) \ (6.53) 

In this limit k' °°, but k'a — » 0, and k ,2 a — > 2 mV Q a/h 2 and remains finite. There 
are no odd solutions of (6.49) in this case, but there is one even root given by 

k' 2 a = k 


7T 2 h 2 
&ma 2 


E + V 0 — (n + iy 


or 


mg 2 
2 h 2 


(6.54) 


Thus, the attractive one-dimensional delta function well supports only one bound 
state. 

This conclusion can be verified directly from the Schrodinger equation, 


h d 2 ijj(x) 
2m dx 2 


- g8(x ) = E4i(x) 


(6.55) 


if we integrate this equation from x = — s to x — +e and then take the limit s — > 0. 
The result is 

lim t/f'(x)|^ + ^ <K 0) - 0 

B — ^0 fl 


(6.56) 
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From (6.47) we see that </>( 0) = C' and 

lim fix) = —2C'k 


E — >0 


Inserting these values in (6.56), we obtain 


K — 


mg 

h 2 


( 6 . 57 ) 


which is equivalent to the energy equation (6.54). 


Exercise 6.13. Use Eq. (6.56) to derive the transmission coefficient T for the 
delta-function potential as a function of energy, for E > 0. Compare with (6.19). 

Exercise 6.14. Show that the energy eigenvalue equations (6.49) can be cast 
in the alternate form 


cos| k'a ~ ny- J = for n ~ < k'a < (n + 1) y- 


(6.58) 


Devise a simple graphical method for obtaining the roots of (6.58). If /3 = 20, 
compute approximate values for the bound-state energy levels in units of h 2 /ma 2 . 


To conclude this chapter we discuss briefly what happens to a particle incident 
from a great distance when it is scattered by a square well. Here E > 0. Actually, 
this problem has already been solved. We may carry over the results for th e potential 
barrier, replacing V 0 by — V 0 and k by ik' , where hk’ — V2 m(E + V 0 ). Equation 
(6.12) becomes 


'A 

kB, 


where 


^ cos 2k' a — — sin 2 k'a\e 2,ka 

k 2 ) 

~ sin 2 k’a 


— sin 2 k'a 
2 

is' 


\ 


cos 2 k'a + — sin 2k' a je 2lka 
. 2 / 


, k’ k 
G k + Jk' ’ 


k' 


V = 


V 


0 

(6.59) 


(6.60) 


Equation (6.59) defines the matrix M for the square well if the energy is positive. 

The transmission coefficient T is obtained from (6.59) by choosing stationary 
states with G — 0 (no wave incident from the right), and writing 


F 

A 


1 


M n 


— Si 2 s/Te 


~it)> 


(6.61) 


This expression defines a phase shift (f> between the transmitted and the incident 
wave. From the properties of the S matrix in Section 6.3, or directly from (6.59), 
we find for the relation between the reflected and the incident wave: 

B Mo 


- = — = S n = Vl - Te~ i(4>±7r/2) 
A M n 


(6.62) 
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E_ 
Vo 

Figure 6.7. Transmission coefficient T versus E/V 0 for a square well with 137 t/ 4. The 
spikes on the left are at the positions of the seven discrete bound-state energy levels. 


For the square well, 



1 

e' 2 

cos 2 2 k'a sin 2 2k' a 

4 


(6.63) 


As E °°, s' — *> 2, and T 1, as expected. As a function of energy, the transmission 
coefficient rises from zero, fluctuates between maxima ( T = 1) at 2k' a — mr and 

7 T 

minima near 2 k'a = (2 n + 1) — , and approaches the classical value T — 1 at the 
higher energies. Figures 6.7 and 6.8 show this behavior for two different values of 



Figure 6.8. Transmission coefficient T versus EIV 0 for a deep square well with /3 = 315. 
As E increases, the resonances become broader. 
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/3. The maxima occur when the distance 4 a that a particle covers in traversing the 
well and back equals an integral number of de Broglie wavelengths, so that the 
incident wave and the waves that are reflected inside the well are in phase reinforcing 
each other. If the well is deep and the energy E low (/3 and s' » 1), the peaks 
stand out sharply between comparatively flat minima (see Figure 6.8). When the 
peaks in the transmission curve are pronounced they are said to represent 
resonances. 

The phase shift 4> can also be calculated from (6.59). We find 


(f) = 2 ka — arctan 



(6.64) 


Figure 6.9 portrays the energy derivative of the phase shift, — in units of V 0 , as a 

dE 


function of energy, for the same square well as in Figure 6.8. The resonances show 
up as pronounced points of inflection in the function (f>(E). 1 


Exercise 6.15. Show that the expressions (6.61) and (6.62) for the elements 
of the S matrix follow from the general properties derived in Section 6.3. From the 
matrix M for the square well, derive the expression (6.64) for the phase shift. For a 
square well with /3 — 315, as in Figures 6.8 and 6.9, compute numerically and sketch 
graphically the energy dependence of the phase shift (f) ( E ) for E > 0 in the resonance 
domain. 



well with y3 = 315. 


‘The energy derivative of the phase shift can be related to the time delay suffered by a particle 
at resonance inside the potential well. See Merzbacher (1970), Section 6.8. 
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Resonance peaks in the transmission of particles are typical quantum features, 
and the classical picture is not capable of giving a simple account of such strong 
but smooth energy variations. Classically, depending on the available energy, T can 
only be zero or one, whereas in quantum mechanics T changes continuously between 
these limits. In the example of the potential barrier of Figure 6.4, although the 
transmission coefficient (6.15) is numerically small for E < V 0 , it is different from 
zero and varies continuously with energy. Classically, for such a barrier. T jumps 
from 0 to 1 at *E = V 0 . Thus, in a certain sense quantum mechanics attributes to 
matter more continuous and less abrupt characteristics than classical mechanics. 

While these observations have general validity, their verification by extending 
the solutions of the Schrodinger for discontinuous potentials to the classical limit 
meets with some obstacles. For example, the reflection coefficient (6.8) does not 
depend on h and is a function of the particle momentum only. Hence, it is apparently 
applicable to a particle moving under classical conditions. Yet classically, R is either 
0 or 1. This paradox is resolved if we recognize that the correct classical limit of 
quantum equations is obtained only if care is taken to keep the de Broglie wavelength 
short in comparison with the distance over which the fractional change of the po- 
tential is appreciable. The Schrodinger equation for the piecewise constant potential 
patently violates this requirement, but the next chapter will deal with potentials for 
which this condition is well satisfied. 

The transmission resonance theory outlined in this section cannot be expected 
to provide quantitative estimates for phenomena in the atomic and nuclear domain. 
Not only is the square well unrealistic as a representation of the forces, but also the 
limitation to one dimension is a gross distortion of the real systems. To appreciate 
the distinction, one only needs to be aware of the totally different energy spectra of 
the bound states in a square well in one and three dimensions. From formula (6.52), 
as well as from the analogous discussion in Section 4.4 for the particle in a box, we 
know that in one dimension the levels are spaced farther and farther apart with 
increasing energy; in three dimensions, however, the density of energy levels in a 
well increases rapidly with increasing energy. Resonances in three dimensions will 
be discussed in Chapter 13. 


Problems 

1. Obtain the transmission coefficient for a rectangular potential barrier of width 2 a if 
the energy exceeds the height V 0 of the barrier. Plot the transmission coefficient as a 
function of E/V 0 (up to E/V 0 — 3), choosing (2 ma 2 V 0 ) 1/2 = (3tt/ 2) h. 

2 . Consider a potential V = 0 for x > a, V = — V 0 for a s x >: 0, and V — + 00 for 
x < 0. Show that for x > a the positive energy solutions of the Schrodinger equation 
have the form 

gi(kx-\-28) g—ikx 

Calculate the scattering coefficient 1 1 — e ZiS \ 2 and show that it exhibits maxima (res- 
onances) at certain discrete energies if the potential is sufficiently deep and broad. 

3. A particle of mass m moves in the one-dimensional double well potential 

V(x) = ~gS(x — a) — g8(x + a) 

If g > 0, obtain transcendental equations for the bound-state energy eigenvalues of 
the system. Compute and plot the energy levels in units of h 2 /ma 2 as a function of 
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the dimensionless parameter mag/ti 2 . Explain the features of this plot. In the limit 
of large separation, 2a, between the wells, obtain a simple formula for the splitting 
A E between the ground state (even parity) energy level, E+, and the excited (odd 
parity) energy level, £_. 

4. Problem 3 provides a primitive model for a one-electron linear diatomic molecule 
with interatomic distance 2 a = |X|, if the potential energy of the “molecule” is 
taken as Zs+d-Xl), supplemented by a repulsive interaction Ag/|X| between the wells 
(“atoms”). Show that, for a sufficiently small value of A, the system (“molecule”) 
is stable if the particle (“electron”) is in the even parity state. Sketch the total po- 
tential energy of the system as a function of (X|. 

5. If the potential in Problem 3 has g < 0 (double barrier), calculate the transmission 
coefficient and show that it exhibits resonances. (Note the analogy between the system 
and the Fabry-Perot etalon in optics.) 

6. A particle moves in one dimension with energy E in the field of a potential defined 
as the sum of a Heaviside step function and a delta function: 

V(x) = V 0 7](x) + gd(x) (with Vo and g > 0) 

The particle is assumed to have energy E > V 0 . 

(a) Work out the matrix M, which relates the amplitudes of the incident and 
reflected plane waves on the left of the origin (x < 0) to the amplitudes on the right 
(x > 0). 

(b) Derive the elements of the matrix S, which relates incoming and outgoing 
amplitudes. 

(c) Show that the S matrix is unitary and that the elements of the S matrix satisfy 
the properties expected from the applicable symmetry considerations. 

(d) Calculate the transmission coefficients for particles incident from the right 
and for particles incident from the left, which have the same energy (but different 
velocities). 

7. For the potentials in Problems 5 and 6, verify the identity 

M 2 + \t\ 2 = Im t 

for the complex-valued amplitudes r and t, if the elements of the S matrix are ex- 
pressed as 5 n = 2 ir and S 2 i = 1 + 2 it. 



CHAPTER 7 


The WKB Approximation 


If the potential energy does not have a very simple form, the solution of 
the Schrodinger equation even in one dimension is usually a complicated 
mathematical problem that requires the use of approximation methods. 
Instead of starting with a simplified potential, as perturbation theory 
(Chapter 8) does, the WKB approximation assumes that the potential 
varies slowly as a function of x. The solution of the Schrodinger equation 
is represented as a modulated constant-potential wave function. The 
method is useful to advance our understanding of tunneling through a 
potential barrier, resonance behavior in the continuum, and exponential 
decay of an unstable system. 

1 . The Method. The WKB method for obtaining approximate solutions of the 

Schrodinger equation 


d 2 \jj 


2m 
h 2 




(7.1) 


is named after its proponents in quantum mechanics, G. Wentzel, H. A. Kramers, 
and L. Brillouin, but has its roots in the theory of ordinary differential equations. It 
can also be applied to three-dimensional problems, if the potential is spherically 
symmetric and a radial differential equation can be established. 

The basic idea is simple. If V — const, (7.1) has the solutions e ±lkx . This sug- 
gests that if V, while no longer constant, varies only slowly with x, we might try a 
solution of the form 




- iu(x) 


(7.2) 


except that the function u{x) now is not simply linear in the variable x. The same 
idea was already used in Sections 2.5 and 3.1 for the time-dependent wave equation 
in order to make the connection between quantum mechanics and Hamilton’s theory 
of classical mechanics. The function u(x ) in (7.2) is related to the function S(x, t ) 
in (3.2) as 


S( x, t ) = hu(x) — Et 


(7.3) 


which explains why the WKB method is occasionally referred to as a semiclassical 
version of quantum mechanics. 

Substitution of (7.2) into (7.1) gives us an equation for the x-dependent 
“phase,” u(x). This equation becomes particularly simple if we use the abbrevia- 
tions 

k(x) = \E ~ V(*)]j if E > V(x) (7.4) 


and 


K 2 


f 2m 

k(x) = ~i\~p [V(x) - E] 


1/2 


= — zk(jc) if E < V(x) 


(7.5) 




114 


Chapter 7 The WKB Approximation 


We find that u(x ) satisfies the equation 


s 

( duX 

dx 2 

\dx) 


12 


(7.6) 


This differential equation is entirely equivalent to (7.1), but the boundary conditions 
are more easily expressed in terms of than u(x). Although the Schrodinger 
equation is linear, (7.6), like the classical Hamilton-Xacobi equation, is a nonlinear 
equation. This would usually be regarded as a drawback, but in this chapter we will 
take advantage of the nonlinearity to develop a simple approximation method for 
solving (7.6). Indeed, an iteration procedure is suggested by the fact that u" is zero 
for the free particle. We are led to suspect that this second derivative remains rel- 
atively small if the potential does not vary too violently. When we omit this term 
from the equation entirely, we obtain the first crude approximation, w 0 , to u: 

K 2 = [k(x)f (7.7) 


or, integrating this, 



(7.8) 


If V is constant, (7.8) is an exact solution. If V varies with x, a successive 
approximation can be set up by casting (7.6) in the form 



, , „ s d 2 u 

"* V) + , £ 5 


(7.9) 


If we substitute the «-th approximation on the right-hand side of this equation, we 
obtain the (n + l)-th approximation by a mere quadrature: 


u n+l (x) = ± J \/k 2 {x) + i u'n{x) dx + C„+i (7.10) 

Thus, we have for n — 0, 


Mj(x) = ± \/k 2 (x) + i u'q(x) dx + Ci 

L (7-11) 

= ± \/k 2 (x) ± i k'(xj dx + Ci 


The two different signs in (7.8), (7.10), and (7.11) give approximations to two 
particular solutions of (7.6). If we denote these by u+ and m_, the general solution 
of (7.6) is expressible as 

u(x) = u+ — i ln[l + Ae ,(u -~ u+y ] + B (7.12) 

= u- - i In [A + e i( “ +_ “- ) ] + B 

where A and B are arbitrary constants, as befits a second-order ordinary differential 
equation. The corresponding solution of the Schrodinger equation is 

ijj(x) = e iuix) = e iB e iu+(x) + Ae iB e iu ~™ (7.13) 

which is a simple superposition of two particular approximate solutions of (7.1). 
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Our hope that the approximation procedure (7.10) will tend toward the correct 
u(x ) is baseless unless ufx) is close to u 0 (x), that is, unless 


k'(x) « |k 2 (x)| 


(7.14) 


In (7.11) both signs must be chosen the same as in the u 0 on which u x is supposed 
to be an improvement. If condition (7.14) holds, we may expand the integrand and 
obtain 1 


The constant of integration is of no significance, because it only affects the nor- 
malization of i/>(x), which, if needed at all, is best accomplished after the desired 
approximations have been made. 

The approximation (7.15) to (7.6) is known as WKB approximation. It leads to 
the approximate WKB wave function 


(*) 


■r 

-~r 


:k( x) + 


1 k'(x) 

2 ~k(x) 


dx + Ci 


(7.15) 


k(x) dx + - In k(x) + C, 


j/r(x) ** 


^fe exp 



k(x)dx 


(7.16) 


In a classically accessible region where E > V(x) and k(x) is real, the two waves 
(7.16) propagate independently in opposite directions. If the WKB approximation is 
valid, the potential changes so slowly with x that as the waves propagate no reflected 
(scattered) wave is generated. 

Condition (7.14) for the validity of the WKB approximation can be formulated 
in ways that are better suited to physical interpretation. If k(x) is regarded as the 
effective wave number, we may for E > V(x) define an effective wavelength 




k(x) 


The convergence criterion (7.14) can then be cast in the 


dk 

dx 


« 1 


(7.17) 


requiring the wavelength to vary only slowly. Condition (7.14) can also be written 
as 


A(x) 


dp 

dx 


« |p(x)| 


or 


A(x) 


dV 


dx 


[p(x)¥ 

m 


implying that the change of the “local momentum” p(x) = hk(x) over a wavelength 
must be small by comparison with the momentum itself, or that the potential energy 
change over a wavelength is much less than the local kinetic energy. 
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These conditions obviously break down if k(x ) vanishes or if k(x) varies very 
rapidly. This certainly happens at the classical turning points for which 

V(x) = E (7.18) 

or whenever F(x) has a very steep behavior. Generally, in these regions we expect 
that waves propagating in one direction will generate reflected (scattered) waves. A 
more accurate solution must be used in a region where (7.14) breaks down. The 
WKB method is not particularly useful unless we find ways to extend the wave 
function into and through these regions. 

In the nonclassical domain, where E < V(x), it is appropriate to rewrite the 
WKB wave function (7.16) in its more recognizably real form 

(7.19) 

The so-called connection formulas serve to link WKB solutions of the type (7.16) 
in the classically accessible region of x with solutions of type (7.19) in the classically 
inaccessible region. 



2. The Connection Formulas. Suppose that x = a is a classical turning point for 
the motion with the given energy, E, as shown in Figure 7.1a. The point x = a 
separates the two regions where E > V and E < V, when the classically impen- 
etrable barrier lies to the right of the classical turning point. Analogous consider- 
ations hold if the barrier is to the left of the turning point x = b (Figure 7.1b). The 
results for the two cases will be summarized in Eqs. (7.34) and (7.35). 

If the WKB approximation can be assumed to be applicable except in the im- 
mediate neighborhood of the turning point, we have 



for x » a (7.20) 


and 



for x « a (7.21) 


The lower limits of the integrals in the exponents have been arbitrarily, but conven- 
iently, chosen to make the meaning of the amplitudes A, B, C, and D unambiguous. 



{a) ( b ) 


Figure 7.1. (a) Classical turning point at x — a, to the right of the classically accessible 
region, (b) Classical turning point at x = b, to the left of the classically accessible region. 
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We now ask the fundamental question: How are the coefficients C and D related 
to A and B if (7.20) and (7.21) are to represent the same state, albeit in different 
regions? The inadequacy of the WKB approximation near the turning point is evi- 
dent, since k{x ) — * 0 implies an unphysical divergence of if/(x). To establish the 
connection between the two separated regions, we must solve the Schrodinger equa- 
tion more accurately than by the WKB approximation. This can always be done 
numerically, but an analytic approach works if a somewhat special, yet often appro- 
priate, assumption is made about the behavior of the potential energy near the turning 
point. 1 

We suppose that in the neighborhood of x — a, we may write 

V(x) ~ E « g(x - a) (7.22) 

where g > 0. The Schrodinger equation for this linear potential, 

h 2 d 2 ifj 

~—^ + g(x-a)^=0 (7.23) 

2m ax 


is conveniently transformed by substitution of the dimensionless variable z, 

= (2mgV' 3 




(jc — a) 


into the form 


d z ip 
dz 2 


- zt(f = 0 


(7.24) 


(7.25) 


Note that 



and that the WKB condition (7.14) implies 

| z [ 3/2 » 1/2 


(7.26) 


Thus, the WKB approximation is simply an asymptotic approximation for the so- 
lutions of Eq. (7.23), applicable where |z| becomes large. 

The solutions of the differential equation (7.25) are the Airy functions 2 Ai(z ) 
and Bi(z). Asymptotically, for large positive z, the leading terms are: 


Ai(z) = \ t T- y2 z - w e-z 


(7.27) 


and 


where 


Bi(z) = t 


(7.28) 



(7.29) 


'For a more general treatment of the WKB approximation at the turning points, and useful his- 
torical references, see Schiff (1968), Section 34. 

2 Abramowitz and Stegun (1964), Section 10.4. 
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(a) 

Figure 7.2. (a) The Airy function Ai(z), and (b) the Airy function Bi(z), and their 
asymptotic (WKB) approximations, for real-valued z. The approximations diverge at 
z = 0. 


For large negative values of z: 

AKZ) - TT- m \z I" 1 " COS (l ~ 



(7.30) 


Bi(z) = -tt l,2 \z\ 1/4 sin 


(7.31) 


Figure 7.2. (continued) 


Figures 1.2a and 1.2b are plots of the Airy functions Ai(z ) and Bi(z ) and their 
asymptotic (WKB) approximations. Except for z = 0, where the asymptotic forms 
diverge, the two functions agree very well. 

Since for the assumed linear potential (7.22), for z < 0, 
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and for z > 0, 

1/2 

f?) (*-«)“ = - 1*1“ = f (7.33) 

we verify that the asymptotic forms (7.27)-(7.31) are nothing but WKB wave func- 
tions of the type (7.20) and (7.21). By comparing the asymptotic expressions for the 
Airy functions with (7.20) and (7.21), we learn that the WKB wave functions on the 
two sides of the turning point x — a are connected as follows: 



The analogous connection formulas for a classical turning point x = b, which sep- 
arates a classically inaccessible region x < b from the accessible region x > b, are 



Exercise 7.1. By making the transformation x — a—> b — x which turns Figure 
7.1(a) into Figure 7.1(b), show that the connection formula (7.35) follows from 
(7.34). 

Exercise 7.2 . By comparing (7.34) with (7.20) and (7.21), derive the coeffi- 
cients C and D in terms of A and B. 

Caution must be exercised in employing these WKB connection formulas. To 
see this, suppose that we know that in the region x » a in Figure 7.1(a) the wave 
function is adequately represented just by the increasing exponential. Since in this 
region the decreasing exponential is naturally much smaller than the increasing ex- 
ponential, the contribution of the decreasing exponential to the wave function may 
be dwarfed even if the coefficient A is comparable in magnitude to B. Neglecting 
the decreasing exponential could then lead to a gross error in the WKB estimate in 
the classically allowed region x « a. 

Conversely, if B is finite but |B| « | A |, neglecting it entirely is justified in the 
region x « a, but as the coefficient of the increasing exponential component of the 
wave function, this “small” contribution may dominate the behavior of the wave 
function in the region x » a. 

The connection formulas, which link WKB wave functions between different 
regions of real and imaginary k(x), break down if two neighboring classical turning 
points are so close to each other that there is no WKB region between them. This 
happens, for instance, when the energy E is close to an extremum of the potential. 
It is then necessary to resort to more accurate analytical or numerical solutions of 
the Schrodinger equation. 
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Finally, we observe that the WKB approximation presumes that k(x) is an an- 
alytic function. This property fails not only at the classical turning points, but also 
at the singularities of the potential V(x). Care is required in continuing the WKB 
wave function through such singularities. 


Exercise 7.3. Show that the WKB approximation is consistent with the gen- 
eralized continuity equation (3.7), and thus with conservation of probability, even 
across classical turning points. 3 


3. Application to Bound States. The WKB approximation can be applied to de- 
rive an equation for the energies of bound states. The basic idea emerges if we 
choose a simple well-shaped potential with two classical turning points as shown in 
Figure 7.3. The WKB approximation will be used in regions 1, 2, and 3 away from 
the turning points, and the connection formulas will serve near x = a and x = b. 
The usual requirement that ifj must be finite dictates that the solutions which increase 
exponentially as one moves outward from the turning points must vanish. Thus, to 
satisfy the boundary condition as x the unnormalized WKB wave function 

in region 1 is 





for x < b 


Flence, by Eq. (7.35), with B — 0, in region 2, 

7r\ 




“ 4 cos (l 


k(x) dx 


This may also be written as 

2 


•A 


Vk(x) 

2 


cos k(x) dx — J 


for b < x < a 


7 T 

k(x) dx — - 
4 ) 


+ 


~\/k(x) 
2 


cos^J k(x) dx^j sin^ J k(x) dx — 


7 r ' 


\/k{x) 


sin^J k(x ) dx ^ cos ^J k(x) dx — — 



Figure 7.3. Simple one-dimensional potential well, Classically, a particle of energy E is 
confined to the region between a and b. 


3 This proposition is equivalent to the constancy of the Wronskian for the Schrodinger equation 
(7.1), which is of the Sturm-Liouville type. See Bradbury (1984), Chapter 7, Section 10. 
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By (7.34) only the second of these two terms gives rise to a decreasing exponential 
in region 3 satisfying the boundary conditions as x — » +°°. Hence, the first term 
must vanish. We obtain the condition 



(7.36) 


where n = 0, 1, 2, ... . This equation determines the possible discrete values of E. 
The energy E appears in the integrand as well as in the limits of integration, since 
the turning points a and b are determined such that V(a ) = V(b) = E. 

If we introduce the classical momentum p{x) — ±hk(x ) and plot p(x) versus x 
in phase space, the bounded motion in a potential well can be pictured by a closed 
curve (Figure 7.4). It is then evident that condition (7.36) may be written as 


J = 


p(x) dx = 



(7.37) 


This equation is very similar to the quantum condition (1.2) in the old quantum 
theory, which occupied a position intermediate between classical and quantum 
mechanics. 

The expression (7.37) equals the area enclosed by the curve representing the 
periodic motion in phase space and is called the phase integral J in classical ter- 
minology. If the WKB approximation is used all the way from b to a , (7.36) measures 
the phase change that the oscillatory wave function i p undergoes in region 2 across 
the well between the two turning points. Dividing this by 277, we see that according 
to the WKB approximation nl 2 + 1/4 quasi-wavelengths fit between b and a. Hence, 
n represents the number of nodes in the wave function, a fact that helps to visualize 
the elusive if/. 

In Figure 7.4, the area of enclosed phase space between the closed curves for 
n+1 and n is equal to h. As for free particles (Section 4.4), in the WKB approx- 


p 



Figure 7.4. Phase space representation of the periodic motion of a particle confined 
between the classical turning points at x = a„ and x = b n for the bound-state energies 
E n = V(a„) = V(b n ). The area between two neighboring trajectories is equal to h. 
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imation each quantum state may be said to occupy a volume h in two-dimensional 
phase space. In statistical mechanics, this rule is useful in the domain where classical 
mechanics is applicable but some concession must be made to the quantum structure 
of matter. 

The WKB method is fittingly called a semiclassical approximation, because for 
high energies ip has a very short wavelength in the classically accessible region. It 
is a rapidly oscillating f unct ion of position, but its maximum amplitude is modulated 
slowly by a factor \/Vk(x). The probability, \ip\ 2 dx, of finding the particle in an 
interval dx at x is proportional to the reciprocal of the classical velocity, Hv(x) 

[ E — V(x)\~ 112 . Classically, this is proportional to the length of time ( dt ) that the 
particle spends in the interval dx. It thus represents the relative probability of finding 
the particle in the interval dx if a random (in time) determination of its position is 
made as the particle shuttles back and forth between the turning points. We thus see 
that the probability concepts used in quantum and classical mechanics, though ba- 
sically different, are nevertheless related to each other in the limit in which the rapid 
phase fluctuations of quantum mechanics can be legitimately averaged to give the 
approximate classical behavior. 

As an illustrative example, we compare the exact solutions of the Schrodinger 
equation with approximate WKB energies and wave functions for the bound states 
of a particle of mass m in a potential well defined by V(x) = g \ x \ . The strength of 
the potential is measured by the positive constant g. Such a V-shaped potential is 
the one-dimensional analogue of the linear central-force potential, V(r) = Cr, to 
which the confinement of the quark-antiquark constituents of the charmonium 
“atom” is attributed. 

It is sufficient to solve the Schrodinger equation for the one-dimensional linear 
potential, 

E 2 d 2 ip . . 

+ = (738) 

for x > 0 and characterize the even and odd parity solutions by imposing the bound- 
ary conditions at the coordinate origin: 

<K(0) = 0 ^(0) = 0 (7.39) 


For x > 0, (7.38) is the same as Eq. (7.23), if we identify the energy E = ga. 
As in Section 7.2, the substitution 




a) 


(x > 0) 


(7.40) 


produces the differential equation for i p(z): 


d 2 4 / 
dz 2 


- z>p = 0 


(7.41) 


The boundary condition at large x requires that t p(z) must vanish asymptotically as 
z — > +°°. This condition implies that the energy eigenfunction must be the Airy 
function Ai{z). It is interesting to observe the universality of the differential equation 
(7.41), which is related to the original Schrodinger equation (7.38) by scaling and 
displacing the independent variable. There are no parameters in (7.41), and the graph 
of the solution in Figure 7.2 can be used for all values of the potential strength, g, 
and for all bound-state energies, E. 
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The unnormalized energy eigenfunctions are ip(z) = Ai(z), or in terms of the 
original x coordinate (x > 0): 


<A» O) = Ai 


/ 2mg\ 

) 


1/3 


X — 


8 


(7.42) 


The conditions (7.39) at the origin now require that for the even eigenfunctions 


<K(0) - AT 

and for the odd eigenfunctions 

t//„(0) = Ai 


/ 2 mg^ 

I 1 ' 3 En' 

[ U 2 y 

' 8 . 


1 2 mg^ 

En_' 

L U 2 y 

8 . 


= 0 


= 0 


(7.43) 


(7.44) 


The (negative) zeros of Ai(z) and Ai'(z) may be computed or read off Figure 7.2(a). 4 
The five lowest energy levels are listed in the second column in Table 7.1. 

In the WKB approximation, the approximate energies are extracted from the 
condition (7.36), which in the present context translates into 

£/ s fhn / l\ 

hr (E - gx) dx = I n + - Jtt 


t 


From this condition we obtain the WKB estimate: 


3 9 J 1 Y g 2 h 2 

(£ ™> - 32 A n + V TT 


(7.45) 


The values of E WKB for n = 0 to 4 are entered in the third column of Table 7.1. 
Except for the ground state energy, the agreement with the ‘ ‘exact’ ’ numerical values 
is seen to be excellent. 


Table 7.1 


n 

E n in units of (g^^/m) 1 ' 3 

Ewkb in units of (g 2 fi- 2 /m) 1 ' 3 

0 

0.8086 (ground state) 

0.8853 

1 

1.8558 

1.8416 

2 

2.5781 

2.5888 

3 

3.2446 

3.2397 

4 

3.8257 

3.8306 


E. 


ft 2 


1/3 


If the coordinate x and the length — , are scaled in units of I — , and are thus 


8 




made dimensionless, the unnormalized energy eigenfunctions functions (7.42) are, 
for x > 0, expressed as 




= Ai 



4 For numerical tables, see Abramowitz and Stegun (1964), Table 10.11. 
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These functions can be evaluated from Figure 1.2(a), if the substitution 

z 

V(x) = g|x] is shown in Figure 8.1.) 


uese luneuuns ean uc evaruaieu iruiii riguic / n uic &uut>uuuiun 

= *?/ 2[x - — ) is made. (The ground state eigenfunction for the potential 

V 8 ) 


Exercise 7.4. For the potential V(.x) = g|;c|, compute (or use Figure 1.2(a) to 
obtain) the energy eigenfunctions for n — 0 to 4 and plot them. For n — 3 and 4 
compare the exact eigenfunctions with the WKB wave functions. For these two 
energy levels, sketch the exact quantum mechanical, the WKB, and the classical 
probability densities of finding the particle at position x. Explain why the WKB 
wave functions have a discontinuous slope at the origin for even values of n. 


Exercise 7.5. In a crude model, the 5 states of the charmonium “atom” are 
regarded as the energy levels of a charmed quark and antiquark, bound by a one- 
dimensional potential that (except near the origin) is represented by V(;c) = g|x|. 
Noting that the two lowest S states have measured rest energies of 3.1 and 3.7 GeV, 
respectively, and ignoring relativistic effects on the binding, obtain an estimate for 
the potential strength parameter g in units of GeV/fm. (Treat this two-body problem 
as an effective one-body problem with a reduced quark mass, i.e., half the rest mass 
of the charmed quark. It is useful to remember that he ~ 0.2 GeV • fm.) 

Exercise 7.6. Show that the WKB approximation gives the energy levels of 
the linear harmonic oscillator correctly. Compute and plot the WKB approximation 
to the eigenfunctions for n = 0 and 1, and compare with the exact stationary state 
wave functions. 


The WKB method, with its connection formulas, is relatively straightforward 
for systems that are described by or reducible to a one-dimensional Schrodinger 
equation. The Hamiltonian of a multidimensional system need not be particularly 
exotic for the trajectories in phase space to display a far more complicated character 
than illustrated in Figure 7.4. When the classical system is integrable and its motion 
multiply periodic, it is possible to generalize the WKB method if due attention is 
paid to the singularities that are the analogues of classical turning points, but in 
practice one deals frequently with systems exhibiting classically chaotic motion. 
Semiclassical quantum mechanics for complex systems has benefited greatly from 
recent advances in (nonlinear) classical dynamics and constitutes a subject beyond 
the scope of this book. 5 

4 . Transmission Through a Barrier. The WKB method will now be applied to 
calculate the transmission coefficient for a barrier on which particles are incident 
from the left with insufficient energy to pass to the other side classically. This prob- 
lem is very similar to that of the rectangular potential barrier, Section 6.2, but no 
special assumption will be made here concerning the shape of the barrier. 


5 Gutz wilier (1990). 
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E 


Figure 7.5. Potential barrier. 


If the WKB approximation is assumed to hold in the three regions indicated in 
Figure 7.5, the solution of the Schrodinger equation may be written as 

k (x < a) 

dx^ (a < x < b) (7.46) 

k cb^J (b < x) 

The connection formulas (7.34) and (7.35) can now be used to establish linear re- 
lations between the coefficients in (7.46) in much the same way as was done in 
Chapter 6 for the rectangular barrier. The result of the calculation is remarkably 
simple and again is best expressed in terms of a matrix M that connects F and G 
with A and B. 


(7.47) 


where the parameter 

(7.48) 

measures the height and thickness of the barrier as a function of energy. 

Exercise 7.7. Verify (7.47). 

The transmission coefficient is defined as 

J, _ 1 ’fttransh V tram _ I ’AtransV^trans I _ \F\ 2 

I'/'inc ! 2 l>inc \^incVk~ c \ Z l A P 
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assuming that there is no wave incident from the right, G — 0. From (7.47) we 
obtain 



(7.49) 


For a high and broad barrier, 6 » 1, and 



(7.50) 


Hence, 6 is a measure of the opacity of the barrier. 

As an example, we calculate 6 for a one-dimensional model of a repulsive 
Coulomb barrier (Figure 7.6), which a charged particle such as a proton (charge Z r e) 
has to penetrate to reach a nucleus (charge Z 2 e). The essence of the calculation 
survives the generalization to three dimensions (Section 13.8). Thus, let Fbe defined 
for x < 0 as 


v= ^ e 2 

JC 

The turning point a is determined by 

E — 

a 


(7.51) 


V 



Figure 7.6. One-dimensional analogue of a Coulomb barrier, which repels particles 
incident from the left. 
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i 


and we take b = 0 , callously disregarding the warning signals that caution us about 
applying the WKB approximation near a singularity of the potential energy. The 
critical integral is then 



V2 mE (° 
h ' Ja 




Z^e 2 

hv 


7 T 


where v = V2 Elm is the classical particle velocity at x — » — . Hence, 




27r Z x Z 2 e 2 \ 
hv ) 


(7.52) 


The barrier inhibits the approach of a positive charged particle to the nucleus, and 
the transmission coefficient is called the penetrability. This quantity determines the 
probability of nuclear fusion, and it is also decisive in the description of nuclear 
alpha decay, since the alpha particle, once it is formed inside the nucleus, cannot 
escape unless it penetrates the surrounding Coulomb barrier. 

Exercise 7.8. Calculate the transmission coefficient for the model Coulomb 
potential (7.51) by assuming, more realistically, that b, instead of being zero is equal 
to a fraction of a, i.e., b - ea. Apply the result to the calculation of the Coulomb 
barrier penetrability for an alpha particle (Z, = 2) with asymptotic kinetic energy E 
in the repulsive field of a nucleus (Z 2 ), with nuclear radius b — R. Express E in MeV 
and R in Fermis. 

As a further application of the WKB method, let us consider the passage of a 
particle through a potential well that is bounded by barriers as shown in Figure 7.7. 
It will be assumed that V(x) is symmetric about the origin, which is located in the 
center of the well, and that V = 0 outside the interval between — c and c. 



Figure 1.1. Potential barriers surrounding a well are favorable for the occurrence of 
narrow transmission resonances. Regions 1 through 7 are defined as: (1) x < — c; 

(2) —c < x < — b; (3) ~b < x < -a; (4) -a < x < a; (5) a < x < b; (6) b < x < c; 
(7) c < x. A wave packet is seen to be incident from the left. 
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In this section, the effect of barrier penetration will 
with an energy E below the peak of the barriers. We are 
the form of the energy eigenfunctions in regions 1 and 7 : 

r 


<i !>(x) = < 


exp(ifcc) + exp(— ikx) 

A 7 B 7 

■^y= expO'Ax) + -^= exp (—ikx) 


be studied for a particle 
particularly interested in 


(x < — c) 
(x > c) 


(7.53) 


When the WKB method is applied to connect the wave function in regions 1 and 7, 
the relation between the coefficients is again most advantageously recorded in matrix 
notation: 




M l 

Bx 


2 ip 


A& + 


AQ : 


cos L — 2 i sin L 


1 


i[ Ad 2 - 


A6 : 


-il Ad- 1 ~ ^ ) cos L 

4<r 


p 2i P 


cos L 


Ad 2 + 1 cos L + 2 i sin L 

AO 2 


Ai 

b 7 


In writing these equations, the following abbreviations have been used: 4 


-L 


k(x) dx, 


-f 


^(x) dx — kc 


It follows from the definition of L and from inspection of Figure 7.7 that 

dL n 
— > 0 
dE 


(7.54) 


(7.55) 


(7.56) 


We will shortly make use of this property. 

The final matrix relation (7.54) has the form (6.39) subject to the condition 
(6.22). This result is expected since, as was pointed out in Section 6.3, the matrix 
that links the asymptotic parts of the Schrodinger eigenfunction has the same general 
form for all potentials that are symmetric about the origin. 

From Eq. (7.54) we obtain, for B 7 — 0, 

A 1 _ fi 2i P 

- 7 - = — = y/fe~** = — (7.57) 

Ai Mn I / , 1 \ 

- 4 6r + cos L — i'sm L 
2 V 4 d 2 J 


According to (6.40), the transmission coefficient is 


1 

l^nl 2 



4 


cos 2 L + 4sin 2 L 


(7.58) 


This quantity reaches its maximum value, unity, whenever cos L = 0, or 

L = (2n + 1) 7t/2 (7.59) 

The condition determining the location of the transmission peaks is seen to be the 
same as the quantum condition (7.36) for bound states. If 6 » 1, so that penetration 


6 Our notation is adapted from a thorough discussion of barrier penetration in Bohm (1951). 
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through the barriers is strongly inhibited, T has sharp, narrow resonance peaks at 
these energies. A graph of T in the resonance region will be similar to Figure 6.8. 

Under conditions favorable for the occurrence of pronounced resonances 
(0 » 1), it may usually be assumed that in the vicinity of the resonances in a 
reasonable approximation, 

cos L ~ + ( — ) (E- E 0 ), sin L ~ ±1 

Substituting these approximations in (7.57) and evaluating the slowly varying quan- 
tity 0 at E = E 0 , we get 


\Zfe~ i4> = + 


172 


E - E 0 + iT/2 


a 2ip 


(7.60) 


where by definition 


0 2 (9L/0£) £=Eo 


(7.61) 


Exercise 7.9. Show that the energy spacing D between neighboring resonances 
is approximately 


D = 


7 T 

dL/BE 


(7.62) 


and that for low barrier penetration (0 » 1), D » r, so that the resonances are 
well separated. 


Exercise 7.10. Apply the resonance approximation to the transmission coef- 
ficient T, and show that near E 0 it has the characteristic Breit-Wigner resonance 
shape, T being its width at half maximum. Compare with Figure 6.8. 


Exercise 7.11. A nucleon of energy E is incident on a one-dimensional toy 
model of a “nucleus” of radius R = 4 fm (diameter 2 R). The attractive potential 
inside the nucleus has a mean depth of V 0 = 65 MeV. Approximately rectangular 
barriers of average 8 fm width and 5 MeV height bound the nucleus symmetrically, 
so that the potential looks qualitatively like Figure 7.7. Estimate the value of the 
barrier opacity d 2 and of L as a function of E. Calculate the energy and width of the 
lowest nucleon transmission resonance below the barrier. Are there other resonances 
nearby? 

5. Motion of a Wave Packet and Exponential Decay. It is instructive to consider 
the motion of a si mple broad wave packet incident with well defined positive mo- 
mentum hk 0 = V2 mE 0 from the left (from region 1 where x < ~c) onto the well 
protected by a barrier, as shown in Figure 7.7. A wave packet, which at t = 0 is 
localized entirely in region 1 near the coordinate x 0 « 0 far to the left of the barrier 
and moving toward positive x, may be represented by the wave function in k-space: 

4>{k) = \4>(k)\e~'^ (7.62) 
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Here | (f>(k) \ is a smoothly varying function with a fairly sharp peak and a width A k, 
as exemplified in Figure 2.1, which vanishes for k < 0. Though, in the presence of 
the potential VQc), k is n ot co nstant, asymptotically (|jc| — » °°) we can use the free- 
particle relation hk — v2mE to express ift(x, 0) as an integral over E instead of k. 
Since the wave packet is narrow in k space, the two variables are related approxi- 
mately as 


k = k 0 *+ (k — k 0 ) = k 0 + 


k 2 — kl 
k + k 0 


“ k 0 + 


k 2 — kf, _ k 0 
2 k 0 ~ 2 


E 

hv 0 


The initial wave packet can then be written as 


<A(x, 0) = -L [ \ ( j>(k)\ e iKx - Xo) dk 

V2i t Jo 

1 roo /'GO 

= — j= \(f>(k)\e ikix - Xo) dE = f(E)e iKx ~ Xo) dE 

V2tt fiv 0 Jo IV ^ V Jo 


(7.63) 


The amplitude f(E), defined in (7.63), is a smoothly varying positive function of E 
with a fairly sharp peak and a width A E ~ v 0 EA k. If if/(x, 0) is normalized to unity, 
f{E) satisfies the normalization condition 



(7.64) 


The representation (7.63) expresses i(/(x, 0) as a superposition of infinite plane 
waves, but we need an expansion of if(x, 0) in terms of the appropriate WKB wave 
functions, whose asymptotic form is given by (7.53). Since no wave is incident from 
the right, we include only WKB wave functions with B 1 — 0. In the asymptotic 
region 1, the expansion has the form 

<K*, 0) = J q f(E)e~ ikx °(^ e ikx + ^ 

where the k-dependent coefficients A x and B x are the same as those that appear in 
(7.53). Equation (7.65) holds because the integral 


— ikx 


\dE 


(x < — c) 


(7.65) 



e -ik(x+xo) dE 


(7.66) 


differs from zero only when the phase in the integrand is stationary, i.e., for values 
of x near x = —x 0 . Elsewhere it vanishes owing to the rapid oscillations of the 
exponential function. Hence, it vanishes in region 1. 

In the asymptotic region 7 to the right of the barrier, the transmitted wave 
function at arbitrary times t > 0 is: 

tp(x, t) = f(E)Vf exp (—*'<£) exp 

In order to study the behavior of this transmitted wave packet near a very narrow 
resonance we assume that the mean energy E 0 of the incident wave packet corre- 
sponds to a resonance. We also assume that the width A E of the packet considerably 
exceeds the width of the resonance (but is much smaller than the interval between 
neighboring resonances). We therefore are entitled to substitute (7.60) into the in- 
tegrand of (7.67). 


ik(x — x 0 ) 


r Et 

i 


dE 


(x > c) (7.67) 
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Except for uncommonly long-range potential barriers, the phase p may be as- 
sumed constant, and equal to p 0 , over the width of the resonance. With these ap- 
proximations, the wave function in region 7 at positive t becomes 


t ) ~ -hf(E 0 )exp 


k 0 

i-~(x - x 0 ) + 2 ip 0 




' x — x 0 
V v 0 
e 0 + ir/2 


dE 

(7.68) 


In (7.68), the integration has been extended to — 00 without appreciable error, as- 
suming that t is not too large. (For extremely long times t —> °°, the results to be 
derived may therefore be suspect.) 

The integral in (7.68) is a well-known Fourier integral that is evaluated in the 
Appendix, Eq. (A. 22). The result is that in the asymptotic region x > c. 


<K*, 0 


± niT f(E 0 )e 2ip ° exp 
0 


r ( x — x 0 


2 h V v 0 


- t 


exp[ik 0 (x - x 0 )]e iE ° M if t > 

if t < 


x — x 0 
Vo 

X — Xp 

Vo 

(7.69) 


This wave function describes a wave packet with a discontinuous front edge at 
x = x 0 + v 0 t and an exponentially decreasing tail to the left. After the pulse arrives 
at a point jc the probability density decays according to the formula 


if/(x, t) | 2 


7r 2 r 2 |/(£o)| 2 exp 


r 

h 



(7.70) 


Figure 7.8 shows the distorted wave packet at various times after it has reached 
region 7. We may calculate the total probability that at time t the particle has been 


Upix, t )\ 2 



Figure 7.8. The probability density j i (f(x,t) \ 2 of a decaying state for three different times, 
t\ < t 2 < t 3 . At a fixed position x, after the arrival of the wave front, the probability 
density decays exponentially in time. At a fixed time, the probability density increases 
exponentially as a function of x (up to the front edge of the wave front). 
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transmitted and is found anywhere in region 7. For a wave packet whose energy 
spread A E covers a single resonance such that 

D » A E » T (7.71) 


this probability is 

rx 0 +v 0 t 


rx 0 +v 0 t i 

J o t)\ 2 dx = 7r 2 hTv 0 \f(E 0 )\ 2 \ 1 - exp 




Here we have assumed that c « x 0 + v 0 t, so that the lower limit of the integral 
can be set equal to zero. From the normalization (7.64) we obtain as a crude estimate 


27rhv 0 \f(E 0 )\ 2 


1 

A E 


Hence, an order of magnitude estimate for the probability that transmission has 
occurred is 



The total transmission probability for the incident wave packet (7.63), found by 
letting t —> oo, is thus approximately equal to I7A£. Equation (7.72) leads to the 
following simple interpretation: The wave packet reaches the well at time —x 0 /v 0 . 
A fraction T/AE of the packet is transmitted according to an exponential time law 
with a mean lifetime 


T = 




(7.73) 


The remaining portion of the wave packet is reflected promptly. 

The study of resonance transmission affords us an example of the familiar ex- 
ponential decay law, and the well with corresponding barriers can serve as a one- 
dimensional model of nuclear alpha decay. Decay processes will be encountered 
again in Chapters 13 and 19, but it is well to point out here that the exponential 
decay law can be derived only as an approximate, and not a rigorous, result of 
quantum mechanics. It holds only if the decay process is essentially independent of 
the manner in which the decaying state was formed and of the particular details 
of the incident wave packet. 


Exercise 7.12. Show that condition (7.71) implies that the time it takes the 
incident wave packet to enter the well must be long compared with the classical 
period of motion and short compared with the lifetime of the decaying state. 

Exercise 7.13. Resonances in the double well (Figure 7.7) may also be defined 
as quasi-bound states by requiring A l = B 7 — 0 (no incident wave) or M n — 0 in 
analogy to truly bound states [see Eq. (6.51)]. Show that this condition defines poles 
in the S matrix as a function of the complex variable E. The real parts of these 
discrete complex values of E are the resonance energies, whereas the imaginary parts 
are the half- widths. Construct and interpret the asymptotic solution of a wave equa- 
tion (time-dependent Schrodinger equation) which corresponds to one of these com- 
plex E values ( decaying states ). 
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Problems 


1. Apply the WKB method to a particle that falls with acceleration g in a uniform 
gravitational field directed along the z axis and that is reflected from a perfectly elastic 
plane surface at z = 0. Compare with the rigorous solutions of this problem. 

2. Apply the WKB approximation to the energy levels below the top of the barrier in a 
symmetric double well, and show that the energy eigenvalues are determined by a 
condition of the form 



= ±20 


where 9 is the quantity defined in (7.48) for the barrier, a is a constant dependent on 
the boundary conditions, and the integral J k dx is to be extended between the clas- 
sical turning points in one of the separate wells. Show that at low transmission the 
energy levels appear in close pairs with a level splitting approximately equal to ha)hrd 
where oj is the classical frequency of oscillation in one of the single wells. 

3. A particle of mass m moves in one dimension between two infinitely high potential 
walls at x = a and x = —a. In this interval the potential energy is V = —C\x\, 
C being a positive constant. In the WKB approximation, obtain an equation deter- 
mining the energy eigenvalues E < 0. Estimate the minimum value of C required for 
an energy level with £s Oto exist. 

4. Apply the WKB approximation to a particle of mass m moving with energy E in the 
field of an inverted oscillator potential, V(x) — -mwV/2. Determine the WKB wave 
functions for positive and negative values of the energy, E. Estimate the limits of the 
region in which the WKB wave functions are expected to be valid approximations to 
the exact Schrodinger wave function. 



CHAPTER 8 


Variational Methods and Simple 
Perturbation Theory 


4 

Variational methods are as central to quantum mechanics as to classical 
mechanics. They also serve as a springboard for numerical computation. 
When quantum mechanics is applied to realistic physical systems, we 
must usually employ approximation methods. The WKB method (Chapter 
7) is limited to models that tend to be oversimplified. Simple perturbation 
theory, introduced in this chapter, greatly extends our access to interesting 
applications. A more systematic study of perturbation methods is the 
subject of later chapters (18 and 19). The Rayleigh-Ritz variational 
method provides a bridge from wave mechanics to matrix mechanics. The 
molecular approximation capitalizes on the mass disparity between 
electrons and nuclei. This chapter concludes with examples of 
applications to molecular structure and the band theory of solids. 


1. The Calculus of Variations in Quantum Mechanics. Explicit analytic solu- 
tions of the Schrodinger equation can be derived only for a limited number of po- 
tential energy functions. The last three chapters dealt with three of these: the har- 
monic oscillator, the piecewise constant potential, and the linear potential. Another, 
the Coulomb potential, will be studied in Chapter 12. In the meantime, we will 
consider some very general variational methods, which make it possible to obtain 
approximate energy eigenvalues and which also shed light on the physical properties 
of the stationary states. 

The time-independent Schrodinger equation can be regarded as the Euler- 
Lagrange equation for a variational problem that can be formulated as follows: 

Subject to the constraint 




iff*(r)4>(r)d 2 r = 1 


( 8 . 1 ) 


find the functions and ip*(r) that cause the variation 81 of the expression 



( 8 . 2 ) 


to vanish. 

More graphically, but somewhat less precisely: Find the functions fi(r) and 
which make the integral (8.2) stationary in the mathematical sense of the 
term. If ij/( r) and ijj*( r) are sufficiently smooth and vanish properly at the boundary, 
the expression (8.2) can be transformed by integration by parts into 



2m 


ift*V 2 ip 4 - 



(8.3) 
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which is the formula for the expectation value of the energy, so that I = { H ). For 
the purpose of establishing the variational principle, Eq. (8.2) is more appropriate. 
For many applications (8.2) and (8.3) are equivalent, but (8.2) has precedence when- 
ever there are ambiguities. 

For an understanding of the variational principle and its implications, it is suf- 
ficient to work with the one-dimensional case. According to the method of Lagran- 
gian multipliers, the constrained problem posed here is equivalent to an unconstrai- 
ned variational problem for the integral I — Efij/*if/ dx: 

8 f F(x) dx = 8 f (^— + Vif/*if/ - Eif/*if\dx = 0 (8.4) 

J J \2m dx dx J 

where E is an as yet undetermined constant, the Lagrangian multiplier. If if/(x) and 
if/*(x) are varied independently of each other, as if they were two unrelated functions 
with their conjugate relationship temporarily ignored, two differential (Euler) equa- 
tions are obtained: 


OF 

dif/* 

dF 

dif/ 


d dF _ h 2 d 2 if/ 
dx dif/*' 2m dx 2 
d dF h 2 d 2 if/* 

dx dif/' 2m dx 2 


+ Vif/ - Eiff = 0 
+ Vif/* - Eif/* = 0 


(8.5) 


where if/' and if/*' denote derivatives with respect to x. Both of these equations have 
the form of the time-independent Schrodinger equation. Since if/(x) and if/*(x) satisfy 
the appropriate regular boundary conditions, E is assured to be a real number. Hence, 
all equations are consistent if if/*(x) is chosen as the complex conjugate of i f/(x). 
[The procedure of varying if/(x) and ip*(x) independently is equivalent to expressing 
if/ in terms of Rei f/ + i Imi f/ and varying Rei f/ and Imip independently.] 

The simultaneous use of if/(r) and if/*(r) may seem like an unnecessary com- 
plication if V is real. It leads, however, to a variational expression with a simple 
physical significance, since (8.3) denotes the expectation value of the energy. As 
the energy, this quantity has a lower bound, and the energy of the ground state of 
the system can therefore be determined as the absolute minimum of ( H ). The vari- 
ational procedure is easily generalized to a particle in three dimensions and to the 
Schrodinger equation for a particle in a magnetic field, which is not invariant under 
time reversal and has no simple reality properties. 

The equivalence of the variational principle and the Schrodinger equation can 
also be demonstrated without constraining if/( r) and if/*( r) as in (8.1) and using a 
Lagrangian multiplier, but instead by expressing the expectation value of the Ham- 
iltonian as 


/ I-t\ ^ 

if/*Hif/ d 3 r 

\H) - 

j i f/*ip d 3 r 


( 8 . 6 ) 


If the variation of (H) is defined by 

J (if/* + 8if/*)H(if/ + 8if/)d 2 r J if/*Hif/ d 3 


8(H) - 


J (if/* + 8if/*)(if/ + 8if/)d 3 r j if/*if/ d 3 


(8.7) 
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for trial functions ip + Sip and ip* + 8if/*, upon expanding this expression we obtain: 

^ J i p*ip d 3 r^j 8(H) = J ip*ip d 3 r J 8ip*Hip d 3 r + J ip*H8ip d 3 r j 

— f i p*Hip d 3 r f 8ip*ip d 3 r + f i p*8ip d 3 r + 0[(8ip) 2 ] 

J \- J J -I (8.8) 


where 0[(8tp) 2 ] symbolizes all terms of order higher than the first in the generally 
independent variations Sip and Sip*. Two conclusions can be drawn from (8.8): 

1 . If i p = ip k is one of the normalized eigenfunctions of H, such that 



and 



d 3 r = E k 


then 8(H) = 0 to first order, and (H) is made stationary by the eigenfunction. 

2. Conversely, if all first-order variations 8(H) are stipulated to vanish, then ip 
must be an eigenfunction of H. The proof of this proposition follows if we use the 
Hermitian property of H and choose as the variation of ip: 


Sip = 


(If 


i p*ip d 3 r )Hip 


-(/ 


ip*Hip d 3 r 1 1 p 


with a small real number e. The condition 8(H) = 0 then gives, from the first-order 
terms in (8.8), the equation 

Hip = Eip (8.9) 

where 

E = (H) (8.10) 

Condition (8.9) is the equation whose eigenfunctions and eigenvalues are ip k and E k . 

To summarize: If a trial function ip k + Sip, differing from an eigenfunction ip k 
by 8tp, is used to calculate the expectation value of the Hamiltonian, E k + 8(H) is 
obtained, and for small 8ip the change 8(H) is of order (Sip) 2 . 

In practice, then, we can obtain various overestimates of the ground state energy 
by calculating (H) for suitably chosen trial functions. Obviously, altogether erro- 
neous estimates are likely to be obtained unless the trial function simulates the 
correct ground state wave function reasonably well. Usually, it is necessary to strike 
a compromise between the desire to improve the estimate of the ground state energy 
by choosing a “good” wave function and the requirement of ease of calculation. 

The variational method is frequently applied to the Schrodinger equation by 
using for the calculation of (H) a trial wave function that contains one or more real 
variable parameters a, \ 3, y ... If the expectation value (H) is a differentiable func- 
tion of these parameters, the extrema of (H) are found with the help of the equations 

m = m = m = ... = 0 (8 . H) 

da d/3 dy 

Evidently, the absolute minimum of (H) obtained by this method gives an upper 
bound for the lowest eigenvalue of H. This bound may even be a fair approximation 
to the lowest energy eigenvalue if the parametric trial function is flexible enough to 
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simulate the ground-state eigenfunction tj/ 0 . The other, relative, extrema of (H) may 
correspond to excited states of the system, provided that the trial function is suffi- 
ciently adaptable to represent the desired eigenstate of H to a reasonable approxi- 
mation. 

As an example, we consider the Schrodinger equation for the potential 
V(x) = g\x\. This problem was solved in Section 7.3 by exact methods and by 
applying the WKB approximation. Here we make use of the variational method by 
choosing a normalized Gaussian trial function < with adjustable width, 

/ \ 1/4 

«*) = (8.12) 


The expectation value of H becomes 


r+<*> f *2 d 2 ) h 2 OL 

(H) = ^--— + g \ x \^ m dx = — 

From this expression we obtain the condition for a minimum: 

d{H) h 2 


+ 


8 


V2 


rra 


(8.13) 


da 


8_ 

2m 2V27T 


- 3/2 


or 


_ ( 8 m \ 
a VV2rfJ 


\ 2/3 


When this result is substituted in (8.13), the variational estimate 


(H) = 


1 


7 T 


1/3 


1 1 

2 4/3 + 2 1 « 


' 2*2 
8 » 


1/3 




= 0.813 


m 


m 


1/3 


(8.14) 


(8.15) 


is obtained. As expected, this estimate exceeds the exact ground state energy 

(g 2 f > 2 \ V3 * 

E 0 - 0.809 5 * 


m 


but is remarkably close to this value. Figure 8.1 shows a comparison of the exact 
(Airy) ground state wave function and the optimal Gaussian trial function. 


Exercise 8.1. For the Schrodinger equation with a potential V(x) = g|x| (with 
g > 0), use an optimized exponential trial function to estimate the ground state 
energy. How does the result compare with the estimate from the Gaussian trial 
function? 


Exercise 8.2. Calculate the variational estimate for the ground state energy of 
the potential V{x) = g)x| (with g > 0), using the triangular trial function 

i f/(x) = C(a — |x|) for |jc| ^ a, and i p{x) — 0 for |jc| > a 

Why is it safest to use (8.2) rather than (8.3) here? How good is the estimate? 

Exercise 8.3. Invent other simple trial functions that can be used to estimate 
the ground state energy of the particle in the potential V(x) = g|jc| (with g > 0). By 
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Figure 8.1. The thick line is the normalized ground state energy eigenfunction, 

<ft 0 (x) = 1.47 Ai["^/ 2(x — 0.8086)], for the potential V(x) — |jc|, and the thin line represents 
the optimal normalized Gaussian variational trial function, <M*)- The x coordinate and the 
energy are made dimensionless by scaling, as described in Section 7.3. The mean-square 
deviation between the two functions is only /T “ j i/^ 0 (jc) — dx ~ 10“ 3 ! 


using odd trial functions with only one node, at the origin, obtain numerical esti- 
mates for the first excited state and compare with the exact value of E 1 in Table 7.1. 
Can you proceed to the second excited state and calculate a variational energy es- 
timate for this state? 


2. The Rayleigh-Ritz Trial Function. One of the most prominent applications of 
the variational principle is the Rayleigh-Ritz method. This procedure is applicable 
to the discrete spectrum of an Hermitian operator and consists of using as a trial 
function if/ a linear combination of n suitably chosen, linearly independent, quad- 
ratically integrable functions XT 


n 


<Kr) = E c i Xi( r ) 
1=1 


(8.16) 


The functions Xi are referred to as a set of basis functions. The coefficients c, are to 
be determined from the variational principle. Their real and imaginary parts may be 
taken as the variational parameters, or, as in Section 8.1, the coefficients c t and c* 
may be varied independently. The can be regarded as vectors in an n-dimen- 
sional space that is spanned by the basis ^ (r). The coefficients c, are the components 
of the vector. 

Generally, the basis functions Xi need not be orthogonal or normalized to unity. 
If they are not, we know from Section 4. 1 how to proceed in order to replace them 
by an equivalent orthonormal set of n basis functions. This cumbersome orthogon- 
alization procedure can be avoided by keeping the formalism sufficiently general to 
accommodate the use of nonorthonormal basis functions, much as nonorthogonal 
coordinate systems may be employed in geometry. The Rayleigh-Ritz method will 
be extended to arbitrary basis functions in Section 8.4. In this section, we assume 
that the basis functions are orthonormal. 
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The expectation value (H) is expressed in terms of the c, and c* as 

f iff*Hif/d 3 r 2 c*j{j\H\i) Ci 

J i,j= 1 


(H) = 


J i d 3 r 

where the matrix elements of H are defined by 


2 C*Ci 

ij= i 


01^1 0 - J d 3 


Since H is a Hermitian operator, 

— 

(i\H\j) = J iftlHi/jj d 3 r = j d 3 

= d 3 r^- = (j\H\i)* 

The variational conditions for making ( H ) stationary, 

d(H) d(H) . . 

= 0 and — — = 0 for all i,j— 1, 2, ... n 


dc * 


dc t 


produce the n linear homogeneous equations 


(8.17) 


(8.18) 


(8.19) 


( 8 . 20 ) 


2 (i\H\j)cj = (H)c t 
j= i 


( 8 . 21 ) 


Exercise 8.4. Derive (8.21) from the variational principle, 8(H) = 0. Show 
that, because H is Hermitian, the second set of linear equations derived from (8.20) 
is redundant. 


If we denote the real expectation value of H corresponding to the (initially 
unknown) optimal trial function by (H) — E, the system of equations (8.21) can be 
written as 


n 


2 d\H\j)Cj 


= E c t 


( 8 . 22 ) 


or 


2 (d\H\j) - E8y) Cj = 0 (8.23) 

j= i 

This system of equations for the coefficients c, has nontrivial solutions only if the 
characteristic value (or eigenvalue) E is one of the n roots of the determinantal 
(characteristic or secular ) equation: 

D„ (E) - det«i|tf I;) - E8.J) = 0 


(8.24) 
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If we take the complex conjugate of Eq. (8.22) and use the property (8.19), we obtain 

n 

2 c*(j\H\k) = Eel (8.25) 

7=1 

If Eqs. (8.22) and (8.25) are written for two different characteristic values, E and 
E, corresponding to trial functions if/ and if/, it is easy to see that 

a n n n 

2 2 c t *(i\H\j)cj = E 2 c*c t 

<= i j= i ,= i 

and 

n n n 

2 2 Cj*(j\H\k)c k = E 2 C k *c k 

j—\ *= i k= i 

from which by subtraction it follows that 

n 

(E- E) 2 c,* C| = 0 

i- I 


Hence, the optimal Rayleigh-Ritz trial functions corresponding to different charac- 
teristic values are orthogonal: 


(8.26) 


Since the system of equations (8.22) is linear and homogeneous, any linear combi- 
nation of two or more optimal trial functions belonging to the same characteristic 
value E is also an optimal trial function. Hence, as was shown in Chapter 4, if the 
optimal trial functions are not initially orthogonal, they may be “orthogonalized” 
and replaced by an equivalent set of orthogonal functions. It is important to note 
that all of these results pertain to the approximate solutions of the Schrodinger 
equation, whereas we already know them to be true for the exact solutions. 

Exercise 8.5. Show that if the n optimal orthonormal trial functions, obtained 
by the Rayleigh-Ritz method, are used as basis functions, the matrix elements of H 
are diagonal: 

(i\H\j) = E^Sy (8.27) 


In order to help us appreciate the significance and utility of the n characteristic 
values E produced by the Rayleigh-Ritz method, let us assume that the roots of the 
determinantal equation (8.24) have been ordered so that 

£|s^s...s^s E' n 


The corresponding n optimal trial functions may be assumed to constitute an or- 
thonormal set. If this orthonormal set is used as a basis, (8.27) shows that the de- 
terminantal equation takes the simple form 


DAE) 


E[-E 0 
0 E' 2 -E 

0 0 


0 

0 

K-E 


(8.28) 
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Suppose that an (n+ l)-st basis function is added, which may be chosen to be or- 
thogonal to the previous n basis functions. In this new basis, the determinantal equa- 
tion is also simple: 


D n+l (E ) = 


E\-E 

0 


0 

e' 2 -e 


(l\H\n + 1) 
<2|tf|n+l> 


= 0 (8.29) 


(n+l\H\l) <n+l|tf|2> 


(n+\\H\n+l)-E 


From this equation a new set of n+1 roots or characteristic values can be calculated. 
The values of the determinant D n +i (E) for E = E' r and E = E' r+X can be shown 
to have opposite signs, provided that^+i =£ E' r . Hence, between £' and^+i there 
must be at least one root, a characteristic value. From the behavior of the determinant 
D n +i(E) for E -*■ ± oo, it can be seen that there must also be one characteristic value 
that is <E\ and one that is E' n . Hence, since there are exactly n+ 1 roots, each 
interval between E' r and £' + 1 must contain one new characteristic value, and the two 
remaining characteristic values lie below and above the old spectrum. If 
E' r = E' r+l , a new root of D n +1 (E) coincides with these two previous characteristic 
values, but generally it need not be a repeated zero of D n + i(E). 


Exercise 8.6. If the roots of D 3 (E) = 0 are distinct and ordered as 
E[ < E' 2 < E 3 , show that D 4 (E{) and D 4 {E 3 ) are negative and D 4 (E 2 ) is positive. 
What can you infer about the roots of D 4 (E) — 01 Try to discern a pattern. 


Since the Hamiltonian is an operator with a lower bound for its expectation 
values, it follows that E[ lies above the ground state energy, E' 2 lies above the first 
excited level, E 3 is above the second excited level, and so on. As a new basis function 
is added, all the new variational estimates E" r will move down. Figure 8.2 illustrates 
the situation. 

If in the limit as n — » °° the basis set approaches a complete set of linearly 
independent functions for the domain of H, in the se'nse of convergence “in the 
mean,” the optimal trial functions approach the true eigenfunctions of the Hamil- 
tonian operator. The determinantal characteristic equation becomes infinitely di- 
mensional, and as n is increased its roots approach the eigenvalues of H from above. 

Thus, in practice, the Rayleigh-Ritz method provides a way of estimating upper 
bounds for the lowest n eigenvalues of the energy operator H, and the bounds are 
steadily improved by choosing ever larger values of n. There is no assurance that 
the convergence will be rapid; in fact, generally it will be slow unless the trial 
function can be made to resemble the actual eigenfunctions closely. An application 
of the Rayleigh-Ritz is afforded by perturbation theory for degenerate or near- 
degenerate states in the next section. 

The equations derived in this section for the approximate determination of en- 
ergy eigenvalues and the corresponding eigenfunctions are all familiar from linear 
algebra and can be written in matrix form. We will return to the matrix representation 
in Chapter 9, but the present discussion already shows how to link Schrodinger’s 
wave mechanics with Heisenberg’s matrix mechanics. 

3. Perturbation Theory of the Schrodinger Equation. In every physical theory 
two trends are evident. On the one hand, we strive to formulate exact laws and 
equations that govern the phenomena; on the other hand, we are confronted with the 
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E 


2 



E i 



E Z 


Ex 

(ground 

D n (E') = 0 D n+l (E") = 0 State) 

Figure 8.2. Diagram showing schematically the location of the roots of the characteristic 
determinantal equations D n {E') — 0 and D n+1 (E") = 0 in relation to the exact eigenvalues 
E r of the (r-l)-st excited energy level of the Hamiltonian H, which the approximate 
eigenvalues approach from above. When the (n+l)-st basis function is added, the 
degeneracy E l = E\ is removed and a new approximate eigenvalue, E'„ + u appears. 


need to obtain more or less approximate solutions to the equations, because rigorous 
solutions can usually be found only for oversimplified models of the physical situ- 
ation. These are nevertheless useful, because they often serve as a starting point for 
approximate solutions to the complicated equations of the actual system. 

In quantum mechanics, the perturbation theories are examples of this approach. 
Given a complex physical system, we choose, if possible, a similar but simpler 
comparison system whose quantum equations can be solved exactly. The compli- 
cated actual system may then often be described to good approximation in terms of 
the cognate system. 

Examples of the uses of perturbation theory are legion. For instance, in atomic 
physics the problem of the motion of an electron in a Coulomb field can be solved 
rigorously (Chapter 12), and we may regard the motion of an electron in a real many- 
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electron atom as approximated by this simpler motion, perturbed by the interaction 
with the other electrons. The change of the energy levels of an atom in an applied 
electric field can be calculated by treating the field as an added perturbation, and 
the influence of an anharmonic term in the potential energy of an oscillator on the 
energy spectrum can be assessed by perturbation theory. In a large class of practical 
problems in the quantum domain, perturbation theory provides at least a first qual- 
itative orientation, even where its quantitative results may be inaccurate. This section 
is our first introduction to perturbation theory for stationary states. A more system- 
atic treatment, including the calculation of higher-order approximations, will be 
given in Chapter 18. 

Suppose that a Schrodinger equation 

h 2 

- — V 2 i// + = Eift (8.30) 

2m 

with an unmanageable potential energy V x is given, but that we know the normalized 
solutions of a similar Schrodinger equation 

- — V 2 i// 0) + V 0 d/ (0) = E m ip m (8.31) 

2m 


with Vi differing from V 0 by a small amount V = V) — V 0 - If V is small (in a sense 
yet to be made precise), we intuitively expect that the eigenvalues E (0) of (8.31) will 
be reasonably close to the energy eigenvalues E of (8.30), and that the corresponding 
eigenfunctions i (f (0) will be similar to if/. 

If the unperturbed eigenfunction is used as a trial function and if £ (0) is 
nondegenerate , the variational expression 


K 2 

-S v +v ' 


(H) = J 

= E m + J t// (0) *Vif/ (0) d 


^(0) d s r = j ^(°>*(// 0 + V )^(0) £ 


(8.32) 


represents a sensible, and often quite accurate, estimate of the energy eigenvalue of 
the Schrodinger equation (8.30). The difference potential V is called a perturbation 
of the unperturbed potential energy V 0 , and the change in the energy eigenvalue is 


A E = E- E (0) 


J t/, (0) *Vif/ (0) d 3 


(8.33) 


which is one of the most useful formulas in quantum mechanics. 

If E (0) in (8.31) is degenerate and corresponds to n linearly independent unper- 
turbed eigenfunctions, ^ 0> ^ 0) , the Rayleigh-Ritz method of the last sec- 

tion can be used to calculate a variational estimate of the perturbed energy. Fre- 
quently, this method is also useful if the n unperturbed energies, rather than being 
all equal as in the case of degeneracy, are merely so close to each other that the 
perturbation is effective in “mixing” them. 

We assume that the n unperturbed degenerate or near-degenerate eigenfunctions 
are orthonormal. The matrix elements (8.18) of the Hamiltonian H = H 0 + V now 
take the form 


<i\H\j) = Ef 8 tj + (i\V\j) 


(8.34) 
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The determinantal equation for the estimated energy E is 

det[(£< 0> - E)8 ij + < i\V\j >] - 0 


(8.35) 


The n roots of this (secular) equation are approximate perturbed energy values. 

As an example, we suppose that two particular eigenvalues £) 0) and Ef } of H 0 
are close to each other, or degenerate (in which case E ') 0) = E^), and belong to two 
orthonormal eigenfunctions if/^ and i/4 0) , which will serve as basis functions. Equa- 
tions (8.35) turn^for this case into 


<1 1 V[ 1) + E\ 0) - E (1 1 V|2> 

<2 1 Vj 1) (2\V\2) + Ef y - E 


(8.36) 


with roots 


E U2 = ^ [M 0) + E$> + <1|V|1) + <2| V|2> 


± V(^ 0) + (2 1 V|2) - - <l|Vjl» 2 + 4|(l|y|2)| 2 ] 


(8.37) 


We consider two limiting cases in more detail. 

(a) Here we assume that the unperturbed energies E ) 0) and E are sufficiently 
separated so that 


| E^ + (2|y|2) - M 0) - <l|v|l)| » |(1 1 V|2)| (8.38) 


The magnitude of the “mixing” matrix element (1 1 V\2) is small compared with the 
difference between the first-order perturbed energies. In this case, we can expand 
(8.37) and obtain 


E 


1,2 


M 0) + <1 ! v| 1 ) 

ET + <2 1 V|2) 


|<llV|2)| 2 

Ef> + (2\V\2) ~ M 0) - <1|V|1) 

i<lM2)j 2 

Ef + (2 1 V|2> - Ef ] - <1|V|1> 


(8.39) 


This result shows that for the last term to be negligible and the first-order estimate 
(8.33) for the perturbed energy to be reliable, the perturbation V must be weak and 
the other unperturbed energy levels must be far away on the energy scale. 

(b) In the opposite limit of exact degeneracy of the unperturbed levels, 
Ef } = E^ 0) = E (0) , and (8.37) reduces to 


E = E (0) + ^ [<1|V|1> + (2 1 V\ 2) ± V«1|V|1) - <2|V|2» 2 + 4[<1 1 V|2>| 2 ] 


(8.40) 


Corresponding to these energies, the simultaneous linear equations (8.22) reduce to 


/<1|F|1> <1|V|2>\/cA 
(<2| V| 1> (2| V|2)/\c 2 / 


(E - E (0) ) 



(8.41) 


The amplitudes c 1 and c 2 define the optimal trial functions 

<A= + c 2 </4 0) (8.42) 

Unless (1 ] V| 1) = <2 1 V| 2) and (ljy|2) = <2 j V| 1) = 0, the perturbation splits the 
degenerate levels and the two optimal trial functions are orthogonal. Two states 
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i/4 0) and i/4 0) , which are connected by a nonvanishing perturbation matrix element 
<1 \V\2), are sometimes said to interact with each other. 

Exercise 8.7. If the perturbation affecting two degenerate unperturbed states 
is such that (1 J V| 1) = (2| V|2), show that [ | = \c 2 \ and that the relative phase of 
the amplitudes Ci and c 2 depends on the phase of {1 \V\2). 

Exercise 8.8. Assume that the unperturbed Hamiltonian has an n-fold degen- 
erate energy eigenvalue E (0) = 0, that all diagonal matrix elements (k\V\k) = 0 
(k = 1, 2, . . . , n), and that all off-diagonal matrix elements (&|V|€) ( k =£ €) are 
equal to a real negative value —v (v > 0). Show that the degeneracy is partially 
removed and that the perturbed (ground) state has energy E — — (n — l)v, while all 
the others occur at E = v. Also show that the amplitudes, which define the ground 
state trial function, all have equal magnitudes and phases. 


4. The Rayleigh-Ritz Method with Nonorthogonal Basis Functions. In Section 
8.2, the basis for the Rayleigh-Ritz trial functions was assumed to be orthonormal. 
This is not necessary and not always desirable. Instead, we may choose basis func- 
tions Xi that are neither orthogonal nor necessarily normalized and again consider 
trial functions of the form 


<A(r) = 


X 
1= 1 


c' *i(r) 


(8.43) 


The index i — 1, 2, . . . , n on the undetermined coefficients c‘ has been elevated to 
a superscript for purposes of this section only to emphasize the geometrical signif- 
icance of vector representations like (8.43). 

It is convenient to introduce a second related basis ^(r) in this n-dimensional 
vector space such that 


X J W = 2 Xi(r)g iJ 

i=i 


(8.44) 


and 



J X J *(r)Xt(X)d 3 r = 8j 

(8.45) 

where 8 j t is the Kronecker delta: equal to one if i = j 
From (8.44) and (8.45) it follows that 

and zero otherwise. 


g ij = J x‘*X J d 3y = 8 ji * 

(8.46) 

The g ij can be taken to be the elements of an n X n matrix (with the first superscript 
labeling the rows and the second one the columns). A matrix for which (8.46) holds 
is said to be Hermitian. Similarly, if we define a second Hermitian matrix, 

■ 

S„ = J X*iXj <fr - g*f 

(8.47) 
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we obtain, by substitution of (8.44) into (8.45), the connection 

(8.48) 

If it is assumed that the determinant of the matrix g ki does not vanish, Eq. (8.48) 
can be used to calculate the coefficients g jk . The two Hermitian matrices are inverses 
of each other. 

The expressions (8.46) and (8.47) are said to be overlap integrals, because for 
i + j their values tell us how nonorthogonal the basis functions are. Equation (8.44) 
can now also be inverted: 

n 

x, = I x‘u (8.49) 

1=1 

and an alternative expansion for (8.43) can be derived: 

n n n 

<Kr) = 2 2 ci X j gji = 2 c jX J ( r ) (8.50) 

>— i j= i j= i 

provided we identify 

(8.51) 

On account of the condition (8.45), the new basis x j is called reciprocal to the 
basis Xi- 

If the basis Xi is orthonormal, then the reciprocal basis coincides with it, and 
we have x‘ = Xi and gy = g ij — Sy- Such a basis is self -reciprocal. \ 

The formalism developed in this section is isomorphic to the geometry of an 
n-dimensional vector space. The use of the g' s to raise and lower indices by sum- 
mation over repeated adjacent indices — one upper and one lower — and the matching 
of indices on two sides of an equation are the notational means by which the theory 
is expressed concisely and conveniently. All the relations and rules we have estab- 
lished have the same form independent of the choice of equivalent basis functions, 
provided only that these span the same space of trial functions (8.43). For more 
details, we refer to the standard mathematical literature. 1 

Exercise 8.9. On the interval x = — o° to + °° the nonorthogonal basis func- 
tions Xk( x ) = x k e~\ x \ are introduced with k — 1,2 , ... ,n. For n — 3 and 4, construct 
the reciprocal bases. 

The calculation of the variational expectation value (H) requires evaluation of 
the matrix elements 

H„ = j xlHx, cPr (8.52) 




‘Coxeter (1969), Chapter 18. 
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or the related matrix elements 

H iJ = J X ‘*Hx J d 3 r, H‘ j = J X l *H Xj d 3 r, Hj = j X lH X j d 3 r (8.53) 

Exercise 8.10. Show that any one of the matrix elements defined in (8.52) and 
(8.53) can be expressed as a linear combination of any of the others, e.g., 

H ij = E g*H> = 2 S g ie H (k g kj 

€ = 1 e=i k=\ 

Thus, the g’s serve to raise and lower indices. 

Exercise 8.11. Show that the overlap integrals are the matrix elements of the 
identity operator in the nonorthogonal basis. 

We now implement the variational principle for the functional (H) by varying 
the n components c ; - and c‘* independently. As in Section 8.2, where the basis func- 
tions were orthonormal, we obtain 

n 

2 H)c J = (H)c‘ (8.54) 

7=1 

or their equivalent partners, obtained by raising or lowering indices, such as: 


2 H kj c J = (H) 2 gkic 1 

7=1 


(8.55) 


i= 1 


If we again use the notation ( H ) — E, these systems of equations can also be written 
as 


(8.56) 



and 



(8.57) 


It is a matter of taste which of these sets of homogeneous linear equations one prefers 
to solve. If the basis is orthonormal and thus self-reciprocal, these equations all 
reduce to a single system. The system (8.56) is similar to the simultaneous equations 
(8.23) in Section 8.2. However the matrix H l j is generally not a Hermitian matrix 
because 


r* — I 

H‘, = j x ‘*H Xi d 3 r = | XjWxWr I = H/* 
and this generally does not equal H j t * (except if the basis is orthogonal). 


(8.58) 


Exercise 8.12. Derive the n simultaneous equations (8.57) for the variational 
parameters c' from the condition 8(H) = 0. 


The system (8.57) contains Hermitian matrices, but the overlap integral matrix 
g appears with the eigenvalue. Both (8.56) and (8.57) give the same characteristic 
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values and the same optimal trial functions. The characteristic values are the n roots 
of the determinantal equations 


det (Wj - E8‘j) = 0 

(8.59) 

det (Hy - Egy) = 0 

(8.60) 


Exercise 8.13. From (8.55) deduce another convenient form of the variational 
equation: 

n 

2 H k j Cj = Ec k (8.61) 

7=1 

and prove that the optimal trial functions belonging to two different roots of the 
characteristic equation are orthogonal. 

Exercise 8.14. Show explicitly that the determinantal equations (8.59) and 
(8.60) have the same roots. 

This brief introduction to nonorthogonal basis functions draws attention to a 
variational technique that has wide applicability. Nonorthogonal basis functions are 
particularly useful and popular in molecular physics and quantum chemistry. An 
elementary example in the next section will serve as an illustration. 

5 . The Double Oscillator. As a further illustration of approximation methods, 
we now supplement the discussion of the simple harmonic oscillator (Chapter 5) by 
study of a more complicated potential, pieced together from two harmonic oscilla- 
tors. The example comes from molecular physics. There we frequently encounter 
motion in the neighborhood of a stable equilibrium configuration, approximated by 
a harmonic potential. To be sure, one-dimensional models are of limited utility. Even 
diatomic molecules rotate in space, besides vibrating along a straight line. Never- 
theless, important qualitative features can be exhibited with a linear model, and some 
quantitative estimates can also be obtained. 

Let us consider, as a model, two masses m x and m 2 , constrained to move in a 
straight line and connected with each other by an elastic spring whose length 
at equilibrium is a. If and x 2 are the coordinates of the two mass points, and 
Pi and p 2 their momenta, it is shown in classical mechanics that the nonrelativistic 
two-body problem can be separated into the motion of the center of mass and an 
equivalent one-body motion, executed by a fictitious particle of mass 
m = m 1 m 2 /(m 1 + m 2 ) with a coordinate x — x x — x 2 about a fixed center under the 
action of the elastic force. The correspondence principle suggests that these general 
dynamical features of any two-particle system, which is subject only to an interaction 
potential between the two particles depending on their relative coordinates, survive 
the transition to quantum mechanics. Only the relative motion of the reduced mass 
m will be considered in this section. A full discussion of the separation of the relative 
motion from the motion of the center of mass will be given in Section 15.4. 

The potential representing the relative motion of the reduced mass m is 
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V{x) 



Figure 8.3. The double oscillator potential V(x ) = mo) 2 {\x\—a) 2 l2. The outer classical 
turning points are at x L and x R . If E < V 0 , there are also two inner turning points, but the 
particle can tunnel through the barrier. 


If a = 0, (8.62) reduces to the potential for the simple linear harmonic oscillator. If 
a ^ 0, it is almost the equation of a harmonic oscillator whose equilibrium position 
is shifted by an amount a, but it is not quite that. For it is important to note that the 
absolute value of jc appears in the potential energy (8.62), because Hooke’s law is 
assumed to hold for all values of the interparticle coordinate x. As shown in Figure 
8.3, there are two equilibrium positions (jc = ±a), and we have two parabolic po- 
tentials, one when particle 1 is to the right of particle 2 (x > 0), and the other when 
the particles are in reverse order (jc < 0). The two parabolas are joined at jc = 0 
with the common value V(0) = V 0 = m(o 2 a 2 /2. Classically,, if E < V 0 , we can assume 
that only one of these potential wells is present, for no penetration of the barrier is 
possible. In quantum mechanics the barrier can be penetrated. Even if E < V 0 , the 
wave functions may have a finite value at jc = 0, which measures the probability 
that particles 1 and 2 are found in the same place. 

The wave equation corresponding to the equivalent one-body problem is 


ih 


#(*, 0 

dt 


h 2 d 2 if/(x, t) 
2m dx 2 


+ ]- mct) 2 ( |jc| — a) 2 i^(jc, t) 


(8.63) 


and the Schrodinger equation is 

~ + A ma) 2 (\x\ - a) 2 ij/(x) = Eij/(x) (8.64) 

2m dx 2 


For | jc | » a, (8.64) approaches the Schrodinger equation for the simple harmonic 
oscillator; hence, the physically acceptable eigenfunctions must be required to van- 
ish as | jc | — * °°. 

Before attempting to solve (8.64), we note that as the parameter a is varied from 
0 to +oo, the potential changes from the limit (I) of a single harmonic oscillator 
well (of frequency of) to the other limit (II) of two separate oscillator wells (also of 
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frequency <w), divided by an infinitely high and broad potential barrier. In case I, we 
have nondegenerate energy eigenvalues 


E = 


hoi 



(n = 0,1,2,.. .) 


(8.65) 


In case II, the energy values are the same as those given by (8.65), but each is doubly 
degenerate, since the system may occupy an eigenstate at either the harmonic os- 
cillator well on the left or the one on the right. 

The potential energy for the double well, V(x), is an even function and invariant 
under x reflection. The eigenstates have even or odd parity. The probability distri- 
bution for every energy eigenstate is symmetric about the origin, and in these states 
there is equal probability of finding the system in either of the two potential wells. 
If the state is to favor one side of the potential over the other, we must superpose 
even (symmetric) and odd (antisymmetric) stationary states. The superposition is 
generally not stationary. 

As the limit of case II (a — » °°) is approached, however, the two degenerate 
ground state wave functions are concentrated in the separate wells and do not have 
definite parity. Thus, the reflection symmetry of the double well is said to be hidden, 
or broken spontaneously by the ground state energy eigenfunctions, without any 
external influences. Case II serves to illustrate the concept of spontaneous symmetry 
breaking which arises in many physical systems, particularly in quantum field theory 
and many-body physics. 

As a is varied continuously, energies and eigenfunctions change from case I to 
case II. It is customary to call this kind of continuous variation of an external pa- 
rameter an adiabatic change of the system, because these changes describe the re- 
sponse of the system in time to changes of the external parameter performed infin- 
itesimally slowly. As the potential is being distorted continuously, certain features 
of the eigenfunctions remain unaltered. Such stable features are known as adiabatic 
invariants. 

An example of an adiabatic invariant is provided by the number of zeros, or 
nodes, of the eigenfunctions. If an eigenfunction has n nodes, as the eigenfunction 
of potential I belonging to the eigenvalue E n does, this number cannot change in 
the course of the transition to II. We prove this assertion in two steps: 

(a) No two adjacent nodes disappear by coalescing as a changes, nor can new 
nodes ever be created by the inverse process. If two nodes did coalesce at x = x 0 , 
the extremum of if/ between them would also have to coincide with the nodes. Hence, 
both if/ and its first derivative if '/' would vanish at this point. By the differential 
equation (8.64), this would imply that all higher derivatives also vanish at this point. 
But a function all of whose derivatives vanish at a point can only be if) = 0. The 
presence of isolated singularities in V (as at x = 0 for the double oscillator) does 
not affect this conclusion, since iff and if/' must be continuous. 

(b) No node can wander off to or in from infinity as a changes. To show this, 
we only need to prove that all nodes are confined to the region between the two 
extreme classical turning points of the motion, i.e., the region between x L and x R in 
Figure 8.3. Classically, the coordinate of a particle with energy E is restricted by 
x L s x ^ x R , where x L and x R are the smallest and largest roots, respectively, of 
V(x) = E. From Schrodinger’s equation we infer that 


if/" 

7 


2m 

7 


= 77 [V(x) - E] 


( 8 . 66 ) 
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Then the expression on the right is positive definite in the classically inaccessible 
region. If there were a node in the region x < x L , it would have to be a point of 
inflection for the function «/>(■*). Conversely, a point of inflection would also have 
to be a node. The existence of such a point is incompatible with the asymptotic 
condition: fa(x) -» 0 as x —> - °° . The same reasoning holds for the region x^x R . 
It follows that outside the classical region there can be no node and no extremum 
(see Figure 6.3). 

Being an adiabatic invariant, the number of nodes n characterizes the eigen- 
functions of the double oscillator for any finite value of a. Figure 8.4 shows this for 
the two lowest energy eigenvalues ( n — 0, 1). 

The two eigenfunctions i}/ 0 and fa correspond to E — fiu>! 2 and 3h<x>/2, if a = 0 
(case I). For a — » °° (case II), they become degenerate with the common energy 
fiotH. When a is very large, the linear combinations i jj 0 + fa and i f/ Q — fa are 
approximate eigenfunctions corresponding to the wave function being concentrated 
at x = +a and at x = —a, respectively; the reflection symmetry is broken. 

If a is finite but still large compared with the characteristic amplitude of the 
two separate oscillators, so that 



the harmonic oscillator eigenfunctions fa (x — a) and fa (x + a), obtained from 
(5.27) by shifting the coordinate origin, may be used as basis functions for a vari- 
ational calculation of energy eigenvalues of the Hamiltonian 

2 i 

H = ~ + - ma> 2 {\x\ — a) 2 (8.68) 





Figure 8.4. The two lowest energy eigenfunctions fa and fa for the double oscillator. In 
case I (a = 0) we have a simple harmonic oscilator, no barrier, and E 1 ~ E 0 ~ hco. In case 
II (S/lmodh a = 4), a high barrier separates the wells, and E x - £ 0 == 10~ 3 hco. The 
number of nodes characterizes the eigenfunctions. 
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at least for not too large values of n. Since the exact eigenfunctions of H, for finite 
a, have definite parity, reflection symmetry suggests that the even and odd trial 
functions 


ip± = N ± [ip n (x - a) ± ijj n (x + a)] 


(8.69) 


be used. Here N ± is a normalization constant. Except in the limit of case II (a -*■ °°), 
the two components in (8.69) are not orthogonal; they overlap. For even n, the plus 
sign gives an even (or symmetric) function, and the minus sign an odd (or antisym- 
metric) function. These assignments are reversed for odd n. Since the even operator 
H does not connect states of opposite parity, so that ( even\H\odd ) = 0, it is opti- 
mally efficient to choose tp± as trial functions. Linear combinations of these, a la 
Rayleigh-Ritz, give nothing new. For the normalized trial functions, the variational 
estimate of energies of the double oscillator is then 

(H) = | d 3 r 

If we substitute (8.69) into this formula, we obtain for the real-valued oscillator 
eigenfunctions, 


where 


(H) = 


± B n 
1 ± C„ 


(8.70) 


A n — j ijj n (x — a)Hif/ n {x ~ a) dx — j <//„ (x + a)Hip n (x + a) dx 
B n = J if/ n (x — a)H\Jj n (x + a) dx = J <//„ {x + a)Hi}/ n (x — a) dx 
and C„ is the overlap integral 

C„ = j tjr n (x + a)^ n (x — a) dx 


Also 


AL = 


1 


V2(l ± C n ) 
Using the scaled dimensionless quantity, 

Imo) 


a = 



a 


(8.71) 

(8.72) 

(8.73) 

(8.74) 

(8.75) 


we have for n = 0, 


A 0 = ho) 


1 2a 

2 “s/tt 









2 \frr a 




and for large a: 
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Similarly, 


B 0 — hco 

For a » 1, the leading terms give A 0 = hool 2, B 0 
and thus 


a 


Vtt 2 


and 


C 0 = e~ a 

= — (a/Vir) e~ a2 hco, C 0 = 0, 


(H) « (8-76) 

showing that the degenerate ground state for a -> °o splits symmetrically as a de- 
creases. The even state is the ground state, and the odd state is the first excited state. 
If the height of the potential barrier between the two wells, 

V 0 — ~ mco 2 a 2 — ^ hcoa 2 (8.77) 

is introduced, the splitting of the two lowest energy levels, for the double oscillator 
with V 0 » hco, is 


£_ - E + 


A E = 2hco 


2V 0 

rrhco 



2 Vo\ 
hco ) 


(8.78) 


Exercise 8.15. Work out the above expressions for A 0 , B 0 , and C 0 for the 
double oscillator (8.68), and derive the asymptotic value of ( H ) as given in (8.76). 


The frequency corresponding to the energy splitting (8.78) is 

A E 

T 




— 2(0 


2V 0 


fl(OTT 


exp 


2Vq 


h(o 


(8.79) 


The physical significance of the frequency co s is best appreciated if we consider a 
system that is initially represented by a wave function (again assuming n — 0) 


0) = [t ft+(x) + <A-(x)] 


(8.80) 


If the two wells are widely separated ( a » 1), so that the overlap integral C 0 is 
very small, i{/+ and i/>_ tend to cancel for x < 0 and the initial wave packet is a 
single peak around x= +a. If x = x r — x 2 is the coordinate difference for two elas- 
tically bound particles, as described at the beginning of this section, the initial con- 
dition (8.80) implies that at t = 0 particle 1 is definitely to the right of particle 2, 
breaking the reflection symmetry. The time development of the system is then 


Hx, ,) = 2- 


— exp 


exp I — t E + t\^+(x) + exp -- E~t 


h 




a 


t^j <pjx) 


<A+ + ij/_ 


<0* 


. lj/ + ~ l/f_ . (0 


r- cos — t + i — — 7^ — sin — t 

V2 2 V2 2 


(8.81) 


rhis last form shows that the system shuttles back and forth between 
;^+ + (A_)/V2 and — i/^_)/V2 with the frequency <o s /2. It takes a time t s = ttI(o s 
for the system to change from its initial state (i^ + + i^_)/V^ to the state 
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— i/r_)/V2, which means that the particles have exchanged their places in an 
entirely unclassical fashion, and the wave function is now peaked around x = —a. 
Since E was assumed to be much less than V 0 , this exchange requires that the system 
tunnel with frequency a> s /2 through the classically inaccessible barrier region indi- 
cated in Figure 8.3. The ratio of the time r s during which the exchange takes place 
and the period 27 r/<w of the harmonic oscillator is given approximately by 


& 


(O T s _ 0) 

2tt 2 o) s 


1 jfuorr 
4V~2V^ 



(8.82) 


If the barrier V 0 is high compared to h<o, the tunneling is strongly inhibited by the 
presence of the exponential factor. 


Exercise 8.16. Show explicitly that the expectation value of the parity oper- 
ator for the wave packet (8.81) is zero at all times. [The parity operator changes 
if/(x ) into 


The situation is similar for the higher energy levels (Fig. 8.5). Asymptotically, 
at a — > oo, the doubly degenerate spectrum is E - ho)(n + 1/2). As the separation 
a of the two wells decreases from oo to 0, the lower (even) energy level in each split 
pair of levels first decreases and eventually increases toward E — h(o(2n + 1/2). 


E_ 

hu> 



lowest energy eigenstates of the double oscillator, V(jr) = ma> 2 (\x\ - a) 2 ! 2. For comparison 
with the energy levels, the dashed curve shows the barrier height V 0 . 
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The upper (odd) level increases monotonically toward E = hw(2n + 3/2). To obtain 
the energy eigenvalues for an arbitrary value of a, the Schrodinger equation (8.64) 
must be solved accurately. 

The exact solution of (8.64) is facilitated by substituting for positive x the new 
variable, 


z = 



( x — a) 


(x > 0) 


and expressing the energy as 


E = 



v + 



(8.83) 


in terms of the new quantum number v, which is generally not an integer. We obtain 
for positive x the differential equation 


d 2 if/ 

dz 2 


+ 



(8.84) 


For negative x the same equation is obtained, except that now the substitution 


z = 



(x + a) 


(x < 0) 


must be used. The differential equation valid for negative x is 

d 2 ij/ ( 1 z' 2 \ 

^ + v + 2-t r=° (8 - 85) 

which has the same form as (8.84). The boundary condition ^ 0 as i ±oo 

implies that we must seek solutions of (8.84) and (8.85) which vanish as z —> +°° 
and z' — » — 00 • For a — 0, which is the special case of the simple linear harmonic 
oscillator, we have z' = z, and the two equations become .identical. 

Instead of proceeding to a detailed treatment of differential equation (8.84), we 
refer to the standard treatises on mathematical analysis for the solutions. 2 The par- 
ticular solution of (8.84) which vanishes for very large positive values of z is called 
a parabolic cylinder function. It is denoted by D v (z ) and is defined as 


D v (z) = 


2^/2 e ~z 2 /A 


rq/2) _v l zf 

r[(l - V)I2) v \ 2 ’ 2 ’ 2 ; 


+ 


z T(— 1/2) 



V2 r(- v/2) 


( 8 . 86 ) 


The function 1 F 1 is the confluent hypergeometric (or Rummer ) function. Its power 
series expansion is 


iFj.(a; c; z) = 1 + - 7- + 

c 1! 


a(a + 1) z 1 
c(c + 1) 2! 


(8.87) 


^Magnus and Oberhettinger (1949), Morse and Feshbach (1953), Abramowitz and Stegun (1964), 
Vlathews and Walker (1964), Thompson (1997). 
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If z is large and positive ( z » 1 and z » | v|) 


D v (z) = e 


-z / 4 


J _ y(y - 1) + v{v - l)(^ - 2)(v - 3) 


2 • z 2 ’ 2 • 4 • z 4 

and if z is large and negative (z « - 1 and z « -| v|), 


D v (z) = e 


-z /4 „v 


1 _ y(y ~ 1) + v{v - l)(v - 2){v - 3) 


2 ■ z 2 


\Z2tt 

T(-v) 


e VTTi e Z 2 H z -v-l 


1 + 


2 • 4 • Z 4 

o + l)(i> + 2) 


+ 


2 • z 2 

( v + 1)Q + 2)Q + 3){v + 4) 
2 • 4 • z 4 


+ 


( 8 . 88 ) 


(8.89) 


Although the series in the brackets all diverge for any finite value of z, (8.88) and 
(8.89) are useful asymptotic expansions of D v (z). 


Exercise 8.17. Using the identity 

T(1 + v)T(-v) = ~ ~~~ (8.90) 

sin VTT 

obtain the eigenvalues of the simple harmonic oscillator from (8.89). Show the con- 
nection between the parabolic cylinder functions (8.86) and the eigenfunctions 
(5.27). 


If D v (z) is a solution of (8.84), D v (~z) is also a solution of the same equation, 
and these two solutions are linearly independent unless v is a nonnegative integer. 
Inspection of the asymptotic behavior shows that the particular solution of (8.85) 
which vanishes for very large negative values of z' must be D v (— z'). It follows that 
a double oscillator eigenfunction must be proportional to D v (z) for positive values 
of x and proportional to D v {~z') for negative values of x. The remaining task is to 
join these two solutions smoothly at x = 0, the point where the two parabolic po- 
tentials meet with discontinuous slope. 

As was discussed earlier, the eigenfunctions can be assumed to have definite 
parity, even or odd. The smooth joining condition requires that at x = 0 the even 
functions have zero slope and the odd functions must vanish. Matching < f/ and i f)' at 
x = 0 thus leads to 



for even ip, and 



(8.91) 


(8.92) 


for odd i fj. These are transcendental equations for v. 


Exercise 8.18. Show that if a — 0, the roots of (8.91) and (8.92) give the 
eigenvalues of the simple harmonic oscillator. [Use the properties of the gamma 
function embodied in the identity (8.90).] 
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In general, the roots v of (8.91) and (8.92) are obtained by numerical compu- 
tation. Figure 8.5 shows how the lowest eigenvalues depend on the parameter a 
defined in (8.75). A few of the corresponding eigenfunctions are plotted in Figure 
8.4. The unnormalized eigenfunctions are 

r 


= 



(x — a) 


(. x + a) 


(X 2= 0) 


(x < 0) 


(8.93) 


where the upper sign is to be used if v is a root of D' v (— V2 a) = 0, and the lower 
sign if v is a root of D v {— V2 a ) = 0. 


Exercise 8.19. A convenient representation of a different anharmonic double 
oscillator is the potential (Figure 8.6): 

VQc) = 0 “ a f( x + a f = ITT (* 2 ~ « 2 ) 2 (8.94) 

8 a 8 a 

For large values of the distance between the minima, estimate the splitting of the 
two lowest energy levels. 


Two (or more) potential wells separated by a potential barrier as in (8.62) and 
(8.94) occur frequently in many branches of physics. The dynamics of the vibrations 
of atoms in molecules is the most prominent example. Hindered rotations of a mol- 
ecule, described by an angle-dependent potential such as 

V(d) = Vo(cos 26 - 1) (V 0 > 0) (8.95) 

also represent typical features of a double-well potential, but the boundary condi- 
tions for the Schrodinger equation are different from those for a potential that de- 
pends on the linear coordinate x. 


V(x) 



Figure 8.6. Quartic double-well potential, defined by Eq. (8.94). 
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As a numerical example of molecular vibrations, consider a molecule in which 
the equilibrium distance between the atoms is a — 1 A, and the reduced mass m — 
1(T 27 kg. For to = 2 X 10 15 Hz (corresponding to an infrared vibration spectrum 
common in many molecules), we have V 0 /h(o ~ 10 3 . From (8.82), the shuttling 
period t s turns out to be superastronomical (about lO 400 years); hence, the exchange 
“hardly ever” takes place. On the other hand, if a) — 10 12 Hz, corresponding to 
microwave vibrations of 0.2 cm wavelength, the value of co s obtained from Figure 
8.5 is approximately 8 X 10 u Hz, which is again in the microwave region. Tran- 
sitions corresponding to such barrier tunneling oscillations are commonly observed 
in microwave spectroscopy. 3 

6 . The Molecular Approximation. Although many physical systems are usefully 
modeled by simple one-particle Hamiltonians of the form 

i p 2 

H = f - + V(r) 

2m 

with a prescribed external potential V(r), this fiction cannot always be maintained. 
In most real physical processes two or more partial systems interact dynamically in 
significant ways. Since it is generally far too difficult to treat such a composite 
system exactly, we must resort to various approximation methods, based on simpli- 
fying features of the system and its components. In this section we consider an 
approximation scheme that yields useful zeroth-order energy eigenfunctions for 
starting perturbation calculations in composite systems where the masses of the in- 
teracting subsystems are grossly different. 

The dynamics of interacting atoms offers a valuable example of a complex 
composite system whose subsystems (the valence electrons and the atomic cores) 
can be treated as if they were autonomous though they interact strongly. Molecular 
structure, ion-atom collisions, and condensed matter physics are among the impor- 
tant applications. Because of the large (three to four orders of magnitude) disparity 
in mass between the electrons and atoms, we think of the atomic cores as moving 
relatively slowly under the influence of their mutual interaction and in the average 
field produced by the fast-moving electrons. Conversely, the electrons respond to 
the nearly static field of the cores, in addition to their own mutual potential energy. 

We consider the Hamiltonian for a model of a molecule composed of two (or 
more) massive atomic cores (nuclei plus electrons in tightly bound inert inner shells) 
and some loosely bound atomic valence electrons that move in the common field of 
the cores. This is indicated schematically in Figure 8.7. 

In the simplest case of two positively charged atomic cores and just one shared 
valence electron, the Hamiltonian may be written as 

(8.96) 

where R and P are the relative canonical coordinates of the slowly moving nuclear 
framework with reduced mass M, and r and p refer to the electron of mass m which 
moves much more swiftly. The potential V(R, r) stands for the mutual interaction 



3 Townes and Schawlow (1955). 
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'Igure 8.7. Two valence electrons (rj and r 2 ) in a molecule moving in the field of three 
eavy atomic cores (R!,R 2 ,R 3 ). The ten interactions between the five centers of force are 
idicated. 


etween the cores and the electrons, and for the Coulomb repulsion between the 
lectrons. The core-core interaction is explicitly accounted for by the potential 
^(R), which is chiefly due to the Coulomb forces between the charges. 

In the limit M -*■ °° the nuclear skeleton is frozen in its position, and the 
lectrons move in the static potential field V(R, r), where the external parameter R 
i fixed. The Schrodinger equation for the (quasi-) stationary states of the electron, 

(8.97) 

orresponds to the subsystem Hamiltonian for the electron: 

H e = ^~+ V(R, r) ' (8.98) 

2m 

In principle, once (8.97) is solved for the complete set of quasi-stationary states 
?r all R, the energy eigenfunction of the complete molecular Hamiltonians (8.96) 
lay be expanded as 

<MR, r) = 2 p (O (R)0^(r) (8.99) 

i 

ubstitution of this expression in the Schrodinger equation for the composite system, 

Htft( R, r) - Eif/(R, r) (8.100) 

ields a set of infinitely many coupled equations for the wave functions i 7 (0 (R). How 
oes one avoid having to cope with such an unwieldy problem? 

The simplest idea is to assume that the eigenstates of (8.100), at least near the 
round state, can be reasonably well represented by single terms in the expansion 
T99) and to use the product 

H R, r) = ri®(R)^r) (8.101) 

dth yet-to-be-determined rj (,) (R), as a trial function in the variational integral (H), 
ased on the complete Hamiltonian (8.96). To keep the discussion focused on the 
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essentials, we assume the solution <f ^(r) of (8.97) to be nondegenerate; otherwise, 
one must work in the subspace of degenerate eigenfunctions of (8.97) as in Section 
8.3, using matrices. The approximate eigenfunction (8.101) is said to constitute an 
adiabatic or Born-Oppenheimer approximation to the exact molecular wave func- 
tion. (The term adiabatic refers to the semiclassical picture of the parameter R 
slowly evolving in time, but our treatment is fully quantum mechanical.) 

The variational procedure requires that we add to the expectation value of the 
electron Hamiltonian, 

(H e ) = j r ? (i) *(R)£®(R)r ? (0 (R) d 3 R (8.102) 

the expectation values of the kinetic and potential energies of the heavy structure. 
The former is 

(01 = V (i) *(R)<f) R*(r) V r| rj (i) (R)4> R (r)] d 3 R d 3 r 


which can be transformed into 

+ hi! [VR ^** (r)] • d 3 


~ A| |i7 (0 (R) d 3 R 


(8.103) 


provided that the electron eigenfunctions <^R(r), for each value of R, and the trial 
functions r/ (l) (R) are normalized to unity. The normalization ensures that the vector 
function A R defined as 


A r = ih J </> R *(r)V R ^R ) (r) d 3 r s (8.104) 

is real-valued. (We omit the label i, on which A R depends.) 

The expectation value of H is obtained by adding (8.102) and (8.103) to the 
expectation value of V„(R) to give 


(H) - j V°*(R) 


2 M 


(P - Ar ) 2 + u (f >m 


p (0 (R) d 3 R 


(8.105) 


where the effective potential £/ (0 (R) is defined as 


U 0) (R) = £ (/) (R) + 



i A » + F ” (R) 


(8.106) 


Applied to (8.105), the variational principle 8(H) = 0 leads to an effective Schro- 
dinger equation for the trial function rj (l) (R): 


2 M 



U a \ R) 


T 7 (,) (R) - Erf\ R) 


(8.107) 
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In solving this equation, care must be taken to ensure that the wave function 
ijj( R, r) for the complete system, of which i 7 (0 (R) is but one factor, satisfies the 
correct boundary (and single-valuedness) conditions. 


Exercise 8.20. Derive (8.103) and verify (8.104). 


The notation A r for the vector field defined by (8.104) was chosen to emphasize 
the formal resemblance of Eq. (8.107) to the Schrodinger equation for a charged 
particle in the presence of an external electromagnetic field (Section 4.6). If, as is 
the case for simple molecules, it is possible to choose the electronic eigenfunction 
(f>$( r) real-valued for all R, the vector A R vanishes. More generally, if 

V R X A R = 0 (8.108) 

(corresponding to zero magnetic field in the electromagnetic analogue), the gauge 
transformation 


rj (!) (R) — > exp I 


<f> rO) 


MR 


exp | 
0 


J* Ar • dRj^KR) 
\ £ Ar • <*rW( r) 


(8.109) 


produces no change in the product wave function t^(R, r) of the composite system, 
but reduces the Schrodinger equation (8.107) to the simpler form 



+ U a \ R) 


t/°(R) = £t7 (,) (R) 


( 8 . 110 ) 


Exercise 8.21. If 0R(r) is a real-valued function for all R, show that A R is 
the gradient of a function and therefore can be eliminated by a gauge transformation 
(8.109). 

If the Born-Oppenheimer method is applied to more complex nonlinear mole- 
cules, with R symbolizing a set of generalized atom core coordinates, the condition 
(8.108) may be violated for isolated singular values of R, requiring the use of (8 . 1 07) 
rather than (8.110). The phase integral 



which is gauge invariant, may acquire different values for various classes of topo- 
logically distinct closed loops in the parameter space. This nonintegrability may 
result in the appearance of characteristic spectral features, not expected from the 
simple form (8.110). The vector A R is known as (Berry’s) connection and the in- 
tegral (8.111) as Berry’s (geometric) phase.* 

The success of any calculation in the Born-Oppenheimer approximation is con- 
tingent on the availability of good solutions 0 R*(r) for the parametric Schrodinger 
equation (8.97) for the electron(s). In quantum chemistry, these solutions are gen- 


4 See Shapere and Wilczek (1989). 
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erally known as molecular orbitals. In practice, (8.97) is again treated as a varia- 
tional problem, and 4> r (r) is represented by trial functions that are linear combi- 
nations of energy eigenfunctions of the hypothetically separated atoms, which 
constitute the molecule (linear combination of atomic orbitals, or LCAO). If the 
constituent atomic orbitals are nonorthogonal, their mutual overlap integrals play an 
important role in the calculations. Many-body methods (Chapter 22) are used to 
include the interaction between elections. From all of this, effective energy eigen- 
values, E (i) (R), for the low-lying electronic states are eventually derived and, with 
the inclusion or neglect of the usually small correction terms in (8.106), the potential 
energy surfaces Z7 (0 (R) are computed as functions of the nuclear configuration co- 
ordinates R. 


Exercise 8.22. Assuming the molecular orbital functions 4>r(t) in the expan- 
sion (8.99) to be orthonormal and real, derive the exact coupled integro-differential 
equations for the nuclear configuration wave functions tj w (R) and show that Eq. 
(8.110) results if certain off-diagonal matrix elements are neglected. 


To illustrate the approximation method just described, we consider the sche- 
matic model of a one-dimensional homonuclear diatomic molecule, which can vi- 
brate but not rotate and in which effectively only one electron is orbiting in the field 
of both atomic cores (Figure 8.8). The H X molecular ion is the simplest example. 
If M is the reduced mass of the molecule, X is the relative coordinate of the two 
nuclei, m is the mass of the electron, and x is the electron coordinate relative to the 
center of mass, the Schrodinger equation for this example may be written as 


h 2 d 2 
2M 6X 2 


ft d 2 

— + V(x, X) + V„(X ) 

2m dx 


if/(x, X) = E4>{x, X) 


( 8 . 112 ) 


provided that the center of mass of the entire molecule is assumed to be at rest. We 
also assume that X > 0 and thus now neglect the possibility of exchange between 
the two identical nuclei on the grounds that the barrier tunneling period r s tends to 
infinity. In effect, we assume that the interaction F(jt, X) is not invariant under the 
nuclear reflection X — > —X. The potential V n (X) represents the repulsive interaction 
between the two atomic cores. 

The electron-core interaction is an electrostatic attraction, but we crudely sim- 
ulate it by another double oscillator potential, 


V(x, X) = 


mto 2 



(8.113) 


with X ~ 10“ 10 m being the separation of the two atomic wells. The characteristic 
frequency of the electron motion in the separated atom is of order to = 5 X 10 15 
Hz. The critical ratio V 0 lh(o = (mw/8ft) X 2 is about 0.05 for this case, indicating a 


r I r 

2 0 2 

Figure 8.8. Model of a one-dimensional one-electron homonuclear diatomic molecule, 
with interatomic distance X. The electron (mass m) interacts with the atomic cores of mass 
m n (and reduced mass M ~ mJ2). The spring symbolizes the binding force between the 
cores, attributable in the Born-Oppenheimer approximation to rapid exchange of the 
electron between the slowly moving atoms. 
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low barrier. The electron motion cannot be localized in just one of the two wells 
and is shared by both of them. (For an even more primitive model, see Problems 3 
and 4 in Chapter 6.) 

The Born-Oppenheimer approximation for the wave function consists of assum- 
ing that the variational trial solution has the approximate simple product form 

>P(x, X) = r,V(X)<f> ( iXx) (8.114) 

as in Eq. (8.101). Equation (8.112) separates approximately into a stationary state 
equation (8.97) for the molecular-orbital electron: 

-£ 5 + *r ( w " I) ]**<* = < 8 - 115 > 

corresponding to a fixed value of the parameter X, and a second Schrodinger equation 
describing the motion of the nuclei in the presence of the potential E { J\X): 

+ jj(o ( x) v (0 (x) = Er f) {x) (8.116) 

IM oX 

In (8.116), the potential is 

U (i KX) = E«XX) + V n (X) + dx (8.117) 

2MJ-co 8X 

since the connection A R vanishes for a one-dimensional system. 

By inspection of the scaled Figure 8.5, we see that the potential function E ( p{X) 
for the two atoms has a minimum corresponding to a stable equilibrium configuration 
if the electron is in one of the symmetric eigenstates. Only in these electronic ei- 
genstates is it possible for the attractive exchange interaction mediated by the shared 
electron to overcome the core-core repulsion, allowing the vibrating atoms to be 
bound in the molecule. In this simplistic model, if the system found itself in an 
antisymmetric state it would promptly dissociate. 

Confining ourselves to the electronic ground state (/ = 0), we see that near the 

minimum E° of Ef\X) at X = X 0 ~ is/h/mw we may write approximately 

2 

U m \X) « Ef\X) + V n (X) » E° + C(X - Xq) 2 + V n (X 0 ) 

where C is a number of the order of unity. It follows that th e nuc lei perform harmonic 
oscillations with a vibration frequency of the order of Vm/M (o. The expectation, 
underlying the Born-Oppenheimer approximation, of relatively slow nuclear motion, 
is thus confirmed. Since actual molecules move in three dimensions and can rotate, 
besides vibrating along the line joining the nuclei, their spectra exhibit more com- 
plex features, but the general nature of the approximations used is the same. 

The electronic Schrodinger equation (8.115) with a potential that is invariant 
under the reflection x — > — x models the force by which two similar atoms are bound 
together in a diatomic molecule. As we saw in Section 8.5, the bond in the symmetric 
eigenstates has its origin in the fact that an electron can be exchanged between the 
atoms and is shared by them. An understanding of the more general covalent chem- 
ical binding between dissimilar atoms also relies on the concept of exchange, but 
the mechanism depends on pairs of electrons with spin, rather than on single elec- 
trons as considered here. 
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Exercise 8.23. For a semiquantitative estimate of the properties of the one- 
dimensional model of a homonuclear diatomic molecule with one shared electron, 
assume that the atoms have mass M — 10 u and that the electronic energy scale of 
the separate oscillator “atoms” is fuo = 10 eV. From Figure 8.5, deduce the equi- 
librium distance between the atoms and the corresponding dissociation energy. Es- 
timate the vibration frequency of the molecule. 


7. The Periodic Potential. In a solid, as in a molecule, we deal with slowly 
moving heavy atomic cores and swift valence and conduction electrons, justifying 
the use of the adiabatic approximation. In this section, we apply perturbation and 
matrix techniques to the Schrodinger equation for a particle in the presence of a 
one-dimensional periodic potential composed of a succession of potential wells. As 
a useful idealization of the potential to which an electron in a crystal lattice is 
exposed, we assume that the lattice of potential wells extends indefinitely in both 
directions, although in reality the number of atoms in a crystal is large but finite. 

Our experience with the Schrodinger equation for potentials that are even func- 
tions of x (harmonic oscillator, square well, etc.) has taught us that in order to derive 
and understand the energy eigenvalues and eigenfunctions, it is helpful first to con- 
sider the symmetry properties of the Hamiltonian. Group theory is the mathematical 
discipline that provides the tools for a systematic approach to symmetry in quantum 
mechanics. An introduction to the use of group representations in quantum mechan- 
ics will be given in Chapter 17, but the one-dimensional example of motions in a 
periodic potential can motivate and illustrate the group theoretic treatment. The 
relevant group for the dynamics of a particle in a periodic potential is the group of 
finite displacements or translations, which was introduced in Section 4.5. 

If the potential F(x) is periodic, such that 

D € V(x) = V(x - f) = V(x) 

the Hamiltonian possesses symmetry under finite translation by the displacement 
x — > x - £, for all x. Since the kinetic energy, being a derivative operator, is invariant 
under arbitrary translations, the symmetry of the Schrodinger equation and the Ham- 
iltonian is expressed by 

/ h 2 d 2 \ 

D^Hifj E (x) = ^ + V(x) U £ (x) = HD pjr E (x) = ED ( tp E {x) (8.118) 

From this equation, two conclusions can be drawn: The translation operator D s com- 
mutes with the Hamiltonian, 


D^H - HD S = [D s , H] = 0 

and if tp(x) is an eigenfunction of H, with eigenvalue E, then D^iJj e (x) is also an 
eigenfunction of H, with the same eigenvalue E. 

As a second-order linear homogeneous differential equation, the Schrodinger 
equation for a periodic potential generally admits two linearly independent eigen- 
functions, i/q(x) and i/r 2 (x), corresponding to an energy eigenvalue E. We may assume 
these to be orthogonal and suitably normalized. As eigenfunctions of H, with eigen- 
value E, the displaced functions Dgip^x) and D^ip 2 (x) are linear combinations of i/q(x) 
and t By the standard methods of linear algebra, we may construct those linear 
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combinations of if/fx) and which are eigenfunctions of the translation operator 
D ? . From Section 4.5 we know that the eigenfunctions of D 6 are the Bloch functions 


<M*) = e ikx u k {x) 


(8.119) 


where u k {x) is a periodic function: 

u k (x + g) = u k(x) (8.120) 

The corresponding eigenvalues have modulus one and can be expressed as e ,k ^. (In 
Section 4.5, this eigenvalue was written as e~ lk '^, and k was reserved for an operator. 
In this section, we have no need to refer to k as an operator, and we therefore omit 
the prime on the real number k.) From the preceding discussion we conclude that 
all eigenfunctions of H may be assumed to be Bloch functions. This property of the 
solutions of the Schrodinger equation with a periodic potential is known as Floquet’s 
theorem. 

Since the Schrodinger equation is real (invariant under time reversal), if if/ k (x ) 
is an eigenfunction, its complex conjugate, 

[<M*)]* = e- ikx u*(x ) (8.121) 


is also an eigenfunction of both H and corresponding to the eigenvalues E and 
e ik€ , respectively. The two eigenfunctions, which physically correspond to modu- 
lated plane waves propagating in opposite directions, are certain to be linearly in- 
dependent, except when e ik% = e~ ikS or kg = irn. In the language of group theory, 
the quantum number k (mod 27 rig) labels the irreducible representations of the one- 
dimensional group of finite translations. 

If the function (8.119) is substituted in the one-dimensional Schrodinger equa- 
tion, the periodic function, which we now more specifically designate by u kiE (x), 
must satisfy the differential equation 


h 2 d 2 u KE (x) 
2m dx 2 


ih 2 k du LE (x) h 2 k 2 , s 

— — " + — — u KE (x) + V(x)u KE (x) = E(k)u kiE (x) 

m dx 2m 


or, more compactly and intelligibly, 


H k u KE (x) = 


2m 



2 

u kM (x) + V(x)u KE {x) = E(k)u kE (x) 


( 8 . 122 ) 


In this equation the ^-dependence of the eigenfunctions and eigenvalues has been 
made explicit. Not surprisingly in view of the connection (8.1 19) between i jj k (x) and 
u ktE (x), these two functions are related as in a simple gauge transformation, discussed 
in Section 4.7. When the periodicity condition (8.120) is imposed on the solutions 
of (8.122), a dispersion function E = E(k) between the energy eigenvalues E and 
the quantum number k is obtained. As k is varied continuously over the real axis, 
the energy E ranges over continuous “allowed” bands. The extension of these ideas 
to three-dimensional periodic lattices forms the band theory of solids. 


Exercise 8.24. Derive (8.122) from the Schrodinger equation for i// k (x). 
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Exercise 8.25. In Eq. (8.122) make the replacement 

, 2tt 

k — > k H — — n 
£ 

(changing k by a reciprocal lattice vector), and show that the new equation has 
solutions periodic in x, corresponding to the old energy E(k). Also show that time- 
reversal invariance implies the degeneracy 

E(-k) = E(k ) (8.123) 


Thus, E(k) is a periodic (but also generally multivalued) function of k with 
period 2ir/i;, which is expressed by saying that “E(k) has the same symmetry as the 
reciprocal lattice.” Each interval defined by (2 n — l)7r/£ < k < (2 n + 1 )tt/£ is 
known as a Brillouin zone. Summarizing the results of Exercise 8.25, we conclude 
that 



Thus, the energy E{k) is symmetric (“even”) with respect to k = «7t/£ A plot of 
the continuous function E{k) based on these properties is known as the repeated- 
zone scheme. (An example will be shown in Figure 8.13.) 

Before solving the Schrodinger equation explicitly for an example, we dem- 
onstrate the appearance of allowed (and forbidden) energy bands by applying per- 
turbation theory to two extreme simplified models of an electron in a one-dimen- 
sional lattice. In the first model, we approximate the degenerate unperturbed state 
of a single electron by assigning to every lattice point x — ng an energy value E (0) , 
and an eigenfunction i ff^ix) = ip^ix — ng) that is narrowly concentrated at the n-th 
lattice site. Neglecting any overlap between these sharply localized wave functions, 
we assume them to be mutually orthogonal and normalized. We may imagine that 
in this extreme (“tight-binding”) approximation the electron is confined to a lattice 
site by an infinite potential barrier. We then relax this condition by introducing a 
perturbative interaction V that connects the lattice sites and allows the electron to 
be shared among them (such as, by tunneling through the potential barrier). In an 
ad hoc fashion, we postulate that the matrix elements of V are to be nonzero only 
for the interaction between nearest neighbors: 

{n—l\V\n) — v for all n (8.125) 

From Sections 8.2 and 8.3 we know that appropriate trial eigenfunctions are of the 
form 

¥°\x) = 2 c n ^\x) (8.126) 

n 


and that the amplitudes c„ are to be determined from the infinite set of simultaneous 
linear homogeneous equations, 

+ (£ (0) — E)c n + vc n+i — 0 for all n (8.127) 

These equations are easily solved by the substitution 


(8.128) 
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leading to the condition: 

E(k) = E m + 2v cos k£ 


(8.129) 


This simple model shows how the interaction causes the infinitely degenerate un- 
perturbed energy levels to be spread out into a continuous band of allowed energies, 
labeled by the quantum number k and ranging between E (0) + 2v > E(k ) > E (0> — 2v. 
The energy eigenfunctions, 

<A (0) W = 2 e ink ^ ( n\x) = e ikx 2 e-^-^YoXx ~ ng) = e ikx u k (x) 

n n 

expressible in the form (8.119), are Bloch functions. If there are other tightly bound 
unperturbed energy levels, there will be a separate band for each of them. Allowed 
bands may overlap, but if the unperturbed levels are sufficiently far apart and the 
interactions sufficiently weak, forbidden energy gaps occur between them. 

As a second illustrative model — at the opposite extreme from the tight-binding 
approximation — we consider a free electron that is perturbed by a weak periodic 
potential V(x), with period £. The unperturbed energy eigenstates are represented by 
plane waves, e ,kx , and the unperturbed energy is E (0 Xk) = h 2 k 2 /2m. This model is 
appropriate for sufficiently high energy levels. The critical point here is that the 
periodic potential has nonvanishing matrix elements only between plane wave states 
for which A k — 2n rn/g. (In three-dimensional terms, the k vectors of the two plane 
wave states must differ by a reciprocal lattice vector.) 


Exercise 8.26. Prove that for a periodic potential V(x), 

/•■+■ oo 

e~ ik2X V(x)e ik ' x dx = 0 (8.130) 

unless k x — k 2 — 2rrn /£. Generalize this result to three dimensions (see Exercise 
4.14). 


The diagonal matrix elements (k, = k 2 ) of the perturbing potential are respon- 
sible for a uniform first-order shift of all unperturbed energies, which can be ignored 
if we compensate for it by redefining the zero energy level. The simple perturbation 
theory of Section 8.3 shows that significant corrections to the free particle spectrum 
can be expected when two or more unperturbed energy eigenstates that are connected 
by a reciprocal lattice vector are degenerate, or nearly so. In one dimension, this 
occurs whenever k x — —k 2 = Nir/t; ( N : integer). According to formula (8.40), at 
and near the point of degeneracy the energy levels split and produce a forbidden 
gap of magnitude 


AE = 2 



(8.131) 


The integral in (8.131) is simply the TV-th Fourier component of the periodic 
potential. 

Finally, to avoid the extreme assumptions of the previous two simple models, 
we proceed to an exact calculation of the band structure for the periodic, battlement- 
shaped potential of Figure 8.9. This is known as the Kronig-Penney potential. Its 
period is £ = 2 a + 2b, where 2b is the width of the square wells and 2 a is the length 
of the segments between them. The matrix method introduced in Chapter 6 is es- 
pecially well suited for treating this problem. The solution of the Schrodinger equa- 
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v 



Figure 8.9. Periodic potential with rectangular sections (Kronig-Penney). The period has 
length £ = 2a + 2b, and the well depth is V 0 - 


tion inside the square wells, where V — — V 0 and hk' — V2 m(E + Vo), may be 
written in the form 

<K*) = A n e ik ' {x ~ ne) + B n e~ ik ’ (x - ni) (8.132) 

for a + (n — 1)£ < x < ng — a. Here n can be any integer, positive, negative, or 
zero. The coefficients belonging to successive values of n can be related by a matrix 
using the procedure and notation of Sections 6. 2-6.4. Noting that the centers of the 
plateaus between the potential wells have the coordinates x — ni;, we obtain 


«i + ipi 
2 


A n+l e~‘ k * 
B n+1 e ik ’* 


This may also be written as 


where the transfer matrix P is defined by 

Ao, - 

V 

subject to the condition 


-i&e*'* 

0 a i + iPi)e~ ik '* 


By iteration we have 


det P = aj + p\ - pi = 1 


>n( A o 

\B 0 


(8.133) 


(8.134) 


(8.135) 


(8.136) 


Applying these considerations to an infinite periodic lattice, we must demand that 
as n ±°° the matrix P n should generate an acceptable finite wave function by the 
rule (8.136). This requirement is most conveniently discussed in terms of the eigen- 
value problem of matrix P. Equation (8.133) shows that the transfer matrix repre- 
sents the translation operator Z)_ € . Hence, the eigenvalues p of P may be expressed, 
with real-valued k, as 

p ± = e ±ik * (8.137) 

The eigenvalues of P are roots of the characteristic equation 

det(P — pi) = p 2 — p trace P + det P — 0 
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or 

p 2 — 2 (a x cos k' g + Pi sin k'g)p +1=0 

The roots are 


1 


p± = - [trace P ± V(trace P) 2 — 4] 


(8.138) 


An acceptable solution is obtained, and a particular energy value is allowed only if 
P+ + P ~ = | |trace P\ = \a t cos k’g + fix sin k'g\ < 1 (8.139) 

or, using (8.137), 

cos kg = ai cos k'g + fix sin k'g (8.140) 

This is the desired dispersion relationship between k and E. 

If the roots (8.138) are unequal, or k g A Nir, two linearly independent solutions 

of the Schrodinger equation are obtained by identifying the initial values ( ° ) with 

\ B o/ 

the eigenvectors in: 




'A?>' 


K$ry P± W> 


(8.141) 


For the potential shape of Figure 8.9, oq and fix can be read off Eq. (6.12), and we 
obtain for E < 0 the dispersion relationship, 


cos kg = cosh 2 Ka cos 2k' b + 


k 2 - k 

2Kk' 


r 2 


sinh 2 Ka sin 2k' b (8.142) 


where fiK — \/—2mE. 

Since |cosh 2ko\ s 1, it is readily seen from (8.142) that all energy values for 
which 


2 k'b — Ntt ( N = integer) (8.143) 

are forbidden or are at edges of allowed bands. From the continuity of all the func- 
tions involved it follows that there must generally be forbidden ranges of energy 
values in the neighborhood of the discrete values determined by (8.143). 


Exercise 8.27 . Show that if E > 0, the eigenvalue condition for the periodic 
potential of Figure 8.9 becomes 


£'2 _|_ £"2 

cos kg — cos 2k" a cos 2 k'b " — sin 2k" a sin 2 k'b (8.144) 


where ftk" — V2mE. Verify that the energies determined by the condition 2k" a + 
2 k'b = Ntt are forbidden. 


In Figure 8.10 the right-hand side of Eqs. (8.142) and (8.144) is plotted as a 
function of k' for a particular choice of the constants of the potential. From this plot 
it is then possible to relate the values of the energy E(k) to the parameter k. The 
condition |cos kg \ < 1 separates the allowed bands from the forbidden bands un- 
ambiguously in this one-dimensional potential model. Since, by (8.137), k is defined 
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cos k( 



Figure 8.10. Plot of the right-hand- side of Eqs. (8.142) and (8.144) as a function of k' for 
a Kronig-Penney pot ential (Figure 8.9) with the choice of 4a = 4b — £ for the linear 
dimensions and V2mb 2 V 0 = Trh/2 for the well depth. Since the plotted function equals 
cos k%, only the segments of the curve between — 1 and + 1 are allowed. The enlarged 
detail plot shows the allowed bands (heavy black segments on the abscissa) and the 
forbidden energy gaps between them and illustrates the transition from tight binding to the 
nearly-free regime at high energies. The high peak of the curve at k' — 0 is suppressed. 
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only to within integral multiples of 2t r/£, it is possible to restrict its range so that 
— tt < kg < tt and regard the energy £ as a multivalued function of the reduced 
variable k. This representation of E versus k, shown in Figure 8.11, is called the 
reduced-zone scheme. It is also possible, however, to let k range from — °° to +°° 
and to divide the fc-space into zones of length 2i r/£. In this extended-zone scheme, 
the successive energy bands are assigned to neighboring zones. Figure 8.12 illus- 
trates the extended-zone scheme and shows that the energy spectrum consists of 
continuous bands separated by forbidden gaps. 5 It is natural to associate the lowest 
energy with the first (Brillouin) zone, — tt < k£ < tt, and to let continuity and 


2mb 2 {E+V 0 ) 

h 2 



Figure 8.11, The allowed energies E versus k, in the reduced Brillouin zone scheme, for 
the Kronig-Penney potential defined in Figures 8.9 and 8.10, with V 0 = T^tf/Smb 2 . The 
dashed line indicates the energy E = — l.60h 2 /2mb 2 of the single bound level for the 
isolated well of width 2b and depth V 0 (see Figure 6.6). The dot-dashed line at ir74 
corresponds to E = 0, which is the energy at the top of the barrier. 


5 Christman (1988), Ashcroft and Mermin (1976), and Dekker (1957). Insight can be gained from 
Weinreich (1965). 
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2 mb 2 (E+V 0 ) 
fi 2 



Figure 8.12. The extended-zone scheme for the same conditions as in Figure 8.11. 


monotonicity, as well as physical intuition, guide the proper identification of E as a 
function of k. In the limit E — » +°°, the perturbation method applies, k then ap- 
proaches k' and becomes the wave number of the nearly-free particle. The period- 
icity of the energy in the reciprocal lattice space is emphasized in the repeated-zone 
scheme , which is shown in Figure 8.13 for our Kronig-Penney model. The smooth 
behavior at the zone boundaries is a consequence of the symmetry relation (8.124). 


2 mb 2 (E+Vo) 

W~~ 



Figure 8.13. The repeated-zone scheme for the same conditions as in Figure 8.11. 
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Exercise 8.28. Using the results of Exercise 8.27 for the Kronig-Penney po- 
tential of Figure 8.9, prove in the limit of high energies that kg - 2 k"a + 2 k’b, and 
compare the numerical consequences of this relation with the exact dispersion curve, 
Figure 8.11. In this regime, compute the width of the forbidden gaps in the pertur- 
bation approximation as a function of N. 

Exercise 8.29. If in the periodic Kronig-Penney potential of Figure 8.9 the 
square wells become delta functions in the limit b -» 0 and V 0 -» °o such that V 0 b 
remains finite, show that the eigenvalue condition reduces to 

cos kg = cosh Kg — — sinh Kg 

K 


for E < 0, and 


cos kg = cos k"g — — sin k"g 

K 


for E > 0. Here Hk 0 


2m Y\m{V 0 b) 
h 


V— 2 mE 0 , with E 0 being the binding energy 


of the bound state in the delta-function well (see Section 6.4). Discuss the occurrence 
of allowed bands and forbidden energy gaps. Check the prediction for the gap width 
in the weak perturbation limit. Show that as g — > °°, the allowed band for E < 0 
degenerates into the discrete energy level E 0 . 


We must now look briefly at the eigenfunctions of the Schrodinger equation. 6 
Inside the wells they are of the form (8.132). The coefficients of the plane waves 
for two fundamental solutions corresponding to the same energy are given by 




and 




Because of invariance under time reversal, we may assume the relation 


= (wy 

B V U, + 7 

between the two solutions. From the eigenvalue equation for the matrix P, we obtain 
the ratio 


Aq +) = fee*'* 

oq sin k' g — fe cos k' g — sin kg 

which can be used to construct the eigenfunctions inside the square wells: 
i lf (+ \x) — [j/f (_) (x)]* 

= e ink H^ 2 e ik ' {x ~ {n ~ U2) ^ + (a, sin k'g - ft cos k'g - sin kg) e - lk ' lx ~ (n ~ V2) ®} 
= e lkx e ik(x ni) {- • •} = e ,kx u KE (x) (8.145) 

for a + (n — 1 )g < x < ng — a. In the plateaus where V = 0, the harmonic waves 
are, for E < 0, replaced by increasing and decreasing exponentials, the coefficients 


6 A more detailed discussion of quantum states in simple periodic lattices is found in Liboff (1992). 
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being determined by smoothly joining the eigenfunction at the discontinuities of the 
potential. The function u kjE (x) defined by Eq. (8.145) has the periodicity property 

u KE (x + 0 = u KE {x) 

and the eigenfunctions (8.145) are Bloch functions, as anticipated. 

Since the idealized periodic lattice is infinite and the emphasis has been on the 
energy spectrum, it has not been necessary to specify the boundary and normaliza- 
tion conditions for the Bloch-type energy eigenfunctions if/ E (x) = e lkx u kiE (x ) of the 
Schrodinger equation. As for a free particle, periodic boundary conditions may be 
imposed in a domain L = Ng, which is a large multiple of the lattice spacing. The 
permissible values of k are discrete for finite N but continuous in the limit N — » °° . 

Unlike perfect plane waves, Bloch functions are generally not momentum ei- 
genfunctions, but it is of physical interest to evaluate the expectation value of the 
velocity in Bloch states. Since 

^ ^ ^ 2 ”1 

Px4*k(x) = T — e ikx u k (x) = e ,kx hk + - — w*(x) 
id ) C L 1 dxj 

we have 

(p x ) = m ( v x ) = C J ip*(x)p x if/ k (x) dx = C j uf(x) hk + t ^ (8.146) 

where C is a normalization constant. With the definition (8.122) for the Hamiltonian 
operator H k , the last expression can be written as 

<i v x ) = J J “*(*) ^dk Uk ^ dX = \ (^) (8.147) 

With boundary conditions that ensure that H k is Hermitian, we finally deduce the 
simple result: 

(8.148) 

This relation shows that (v x ) is the group velocity of a wave packet that is a Bloch 
wave modulated by a broad real-valued amplitude function. 



Exercise 8.30. Prove the equality 


dE(k) _ I dH k \ 
dk \ dk / 


(8.149) 


which is a special case of the Hellmann-Feynman theorem (Exercise 18.14). Cal- 
culate (v x ) for the dispersion function E(k) in the tight-binding approximation, 
(8.129). 


Exercise 8.31. Following the derivation and approximation in Section 2.3 for 
free particles, show that in a periodic lattice a wave packet that initially has the form 
of an amplitude-modulated Bloch function, 

i/j(x, 0) = A(x)e ik ° x u ko (x) 

with a broad bell-shaped real-valued amplitude A(x), propagates with constant ve- 
locity (8.148) and without change of shape, except for a phase factor. 
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The two solutions t/t (+) (x) and defined in (8.145) are linearly indepen- 

dent and describe uniformly propagating waves, unless ki; = Ntt (N integer), in 
which case the two solutions become identical. Since it can be shown that for these 
special values of k, 

O'! sin k'g — 0! cos k' % — ±f } 2 (8.150) 

the corresponding eigenfunctions (8.145) represent standing waves, interpretable as 
resulting from Bragg reflection. These solutions correspond to the band edges. In 
the nearly-free particle approximation, the condition for the band edges coincides 
with the equation locating the small forbidden gaps. 


Exercise 8.32. Verify (8.150) for ktj — Ntt. Show that at the band edges, the 
group velocity is 


(v k ) 


dE(k ) 
dk 


(8.151) 


In this chapter, as in the preceding ones, we have formulated quantum mechan- 
ics for systems that are best visualized in some kind of (mostly one-dimensional) 
coordinate space or the complementary momentum space. At almost every step, it 
has been evident that the theory can be greatly generalized if it is expressed in terms 
of the mathematical apparatus of complex vector spaces and their matrix represen- 
tations. We now turn to this general formulation of quantum mechanics. 


Problems 

1. Apply the variational method to estimate the ground state energy of a particle con- 
fined in a one-dimensional box for which V - 0 for —a < x < a, and fl(±a) = 0. 

(a) First, use an unnormalized trapezoidal trial function which vanishes at ±a 
and is symmetric with respect to the center of the well: 


<M*) = 


(a ~ |*|) 
a — b 


b < |*| e= a 
\x\ < b 


Try the choice b — 0 (triangular trial function) and then improve on this by opti- 
mizing the parameter b. 


(b) A more sophisticated trial function is parabolic, again vanishing at the end- 
points and even in x. 


(c) Use a quartic trial function of the form 


if/ t (x) — (a 2 — x 2 )(ax 2 + jS) 

where the ratio of the adjustable parameters a and (3 is determined variationally. 

(d) Compare the results of the different variational calculations with the exact 
ground state energy, and, using normalized wave functions, evaluate the mean- 

square deviation f | ip(x) — ip,(x) \ 2 dx for the various cases. 


(e) Show that the variational procedure produces, in addition to the approxi- 
mation to the ground state, an optimal quartic trial function with nodes between the 
endpoints. Interpret the corresponding stationary energy value. 7 


’Adapted from Cohen-Tamoudji, Diu, and Laloe (1977), Chapter 11. 
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2. Using scaled variables, as in Section 5.1, consider the anharmonic oscillator Ham- 
iltonian, 

H = jpl + J + Af 
where A is a real-valued parameter. 

(a) Estimate the ground state energy by a variational calculation, using as a 
trial function the ground state wave function for the harmonic oscillator 

h 0 (o}) = \pI + \ 6,2 ? 

where a> is an adjustable variational parameter. Derive an equation that relates <u 
and A. 

(b) Compute the variational estimate of the ground state energy of H for various 
positive values of the strength A. 

(c) Note that the method yields answers for a discrete energy eigenstate even 
if A is slightly negative. Draw the potential energy curve to judge if this result makes 
physical sense. Explain. 

3. In first-order perturbation theory, calculate the change in the energy levels of a linear 
harmonic oscillator that is perturbed by a potential gx 4 . For small values of the 
coefficient, compare the result with the variational calculation in Problem 2. 

4. Using a Gaussian trial function, e~ Xxl , with an adjustable parameter, make a varia- 
tional estimate of the ground state energy for a particle in a Gaussian potential well, 
represented by the Hamiltonian 

p 2 2 

H = - V 0 e ~ ax (Vo and a > 0) 

2m 

5. Show that as inadequate a variational trial function as 


f ( X \ 


c h 

for |jc| :£ a 

V a) 


l 0 

for |jc| > a 


yields, for the optimum value of a, an upper limit to the ground state energy of the 
linear harmonic oscillator, which lies within less than 10 percent of the exact value. 

6. A particle of mass m moves in a potential V(r). The n-th discrete energy eigenfunc- 
tion of this system, i/'„(r), corresponds to the energy eigenvalue E n . Apply the vari- 
ational principle by using as a trial function, 

<M r) = <An(Ar) 

where A is a variational (scaling) parameter, and derive the virial theorem for sta- 
tionary states. 

7. In Chapter 6 it was shown that every one-dimensional square well supports at least 
one bound state. By use of the variational principle, prove that the same is true for 
any one-dimensional potential that is negative for all values of x and that behaves 
as V — » 0 as x — » ±°°. 

8. Work out an approximation to the energy splitting between the second and third 
excited levels of the double oscillator defined in Section 8.5, assuming the distance 
between the wells to be very large compared with the classical amplitude of the 
zero-point vibrations. 
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9. Solve the energy eigenvalue problem for a particle that is confined in a two-dimen- 
sional square box whose sides have length L and are oriented along the x- and 
y-coordinate axes with one corner at the origin. Find the eigenvalues and eigen- 
functions, and calculate the number of eigenstates per unit energy interval for high 
energies. 

A small perturbation V — Cxy is now introduced. Find the approximate energy 
change of the ground state and the splitting of the first excited energy level. For the 
given perturbation, construct the optimal superpositions of the unperturbed wave 
functions in the case of the first excited state. 

10 . As an example of Problem 2 in Chapter 7, apply the WKB approximation to the 
double harmonic oscillator of Section 8.5, and contrast the energy level splitting of 
the two lowest levels with the results obtained in Section 8.5. 

11. The energy E 0 (a ) of the lowest eigenstate of a double harmonic oscillator with fixed 
a depends on the distance a and has a minimum at a — a 0 (see Figure 8.5). Adapt 
the Hellmann-Feynman theorem for the expectation value of a parameter-dependent 
Hamiltonian (Exercise 8.30) to this problem, and show that if a — a 0 , the expectation 
value of |x| is equal to a 0 . 

12 . Apply the WKB approximation to a periodic potential in one dimension, and derive 
an implicit equation for the dispersion function E(k). Estimate the width of the 
valence band of allowed bound energy levels. 

13 . Assume that n unperturbed, but not necessarily degenerate, eigenstates | k) of an 
unperturbed Hamiltonian H 0 (with k — 1, 2, ...,«) all interact with one of them, 
say 1 1), but not otherwise so that the perturbation matrix elements (k\V\k') A 0 
only if either k = k' or k — 1 or F = 1. Solve the eigenvalue problem in the 
n-dimensional vector space exactly and derive an implicit equation for the perturbed 
energies. Using a graphic method, discuss the solutions of the eigenvalue problem 
for various assumed values of the nonvanishing matrix elements of V, and exhibit 
the nature of the perturbed eigenstates. 
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Generalizing the concepts of wave mechanics, in this chapter we begin 
to formulate the principles of quantum mechanics in terms of the 
mathematical structure of an abstract vector space (separable Hilbert 
space). Here we review and assemble in one place the tools of the state 
vector and operator formalism of quantum mechanics, including the 
commonly used notational devices. At the end, we close the circle by 
showing how ordinary wave mechanics reemerges from the general theory 
as a special representation. 

1. Probability Amplitudes and Their Composition. The principles of wave 
mechanics were the subject of Chapter 4, and in Chapters 5 through 8 the time- 
independent Schrodinger equation was applied to specific simple dynamical systems, 
mostly in one dimension. These illustrations have provided us with several examples 
of Hamiltonians, or energy operators, whose admissible eigenfunctions constitute 
complete orthonormal sets for the expansion of an arbitrary wave function (or state) 
t jf{x) of the one-dimensional position coordinate, x. Following Section 4.2, we sum- 
marize the standard procedure: The eigenvalue equation for the Hermitian operator 
A is 

Atff n (x) = A' n ifj n (x) (9.1) 

where A' n denotes the nth eigenvalue. The eigenfunctions satisfy the orthonormality 
condition 


f dx = 8 nk (9.2) 

An arbitrary state tp(x) is assumed to be expressible as the series, 

'!'(*) = 'Z c n ifj n (x) (9.3) 

n 

with the expansion coefficients given by 

/*+ oo 

c n = I r n (x)m dx (9.4) 

The coefficients c n are the probability amplitudes. If the eigenvalue A' is simple, 
measurements of A give the result A' n with probability | c n | 2 . If ij/(x) happens to be 
equal to an eigenfunction the measurement of A will yield A' k with certainty 

and confirm that tj/(x)=ij/ k (x). Generally, if/(x) is a superposition of many eigenstates. 

If an eigenvalue of A is not simple but corresponds to more than one linearly 
independent eigenfunction, additional labels are required to characterize the eigen- 
functions, and measurements of A alone do not unambiguously identify all the in- 
dividual terms in the expansion. In this very common case, the determination of 
which ij/ n (x) represents the system can be further narrowed down if the eigenstates 
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of A are also eigenstates of a second Hermitian operator B, with eigenvalues 
B' n , such that eigenstates which correspond to the same eigenvalue of A are distin- 
guished by different eigenvalues of B, to which they correspond. Operators A and B 
are said to be compatible. If after the introduction of B the simultaneous measure- 
ments of A and B still do not unambiguously specify all the eigenstates, the proce- 
dure is continued until a complete set of compatible Hermitian operators is found. 
Since they share a complete set of common eigenfunctions, the physical observables 
that these operators represent are simultaneously assignable to the system, or si- 
multaneously measurable. As we will show in the next chapter, the necessary and 
sufficient mathematical condition for two observables A and B to be compatible, or 
to possess a common complete set of eigenstates, is that the operators A and B 
commute. For the present, we symbolize a complete set of operators by a single 
operator K, with simple eigenvalues K ( . In order to simplify the notation, we omit 
the prime on the discrete eigenvalues of a generic operator like K and rely on the 
subscript alone to identify AT, as denoting a set of eigenvalues. 

With the experience of ordinary Schrodinger wave mechanics as our guide, we 
now endeavor to build up a general formalism of quantum mechanics for any dy- 
namical system. The fundamental assumption of quantum mechanics is as follows: 


The maximum information about the outcome of physical measurements on a 
system is contained in the probability amplitudes that belong to a complete 
set of observables for the system. 

If the state is denoted by M*, the probability amplitude for finding AT, in a measure- 
ment of AT is a complex number, usually denoted by {AT,|M / ). The probability 
of finding K t when K is measured is given by |(Ar / |'\l r )| 2 . Applied to the wave- 
mechanical formalism encapsulated in Eqs. (9.1)-(9.4), the probability amplitude c n 
can be expressed as c n = (A^|T r ). 

We stress again that the quantity K may stand for several compatible observa- 
bles; the nature of the physical system determines how many variables K and its 
eigenvalues AT, represent and what they are. For example, a structureless particle 
moving in three dimensions is conveniently described by the three commuting ob- 
servables, x, y, z, measuring the particle position. The corresponding probability 
amplitude (x, y, z^) is nothing other than the coordinate state function or wave 
function if/(x, y, z ) of the particle. An alternative set of commuting observables is 
p x , P y > P Z i an d the probability amplitude (p x , p y , p z |M > ‘) is the momentum wave func- 
tion (f)(p x , p y , p z ). A very different complete set of commuting observables for the 
same system consists of the three linear harmonic oscillator Hamiltonians, 


with eigenvalues 



- /n.fojfx? 
2 1 J J 


U = 1, 2, 3) 


Ej 


= ho}\ n i 



where the quantum numbers are nonnegative integers. (We use the notations x, y, 
z and jci, x 2 , x 3 interchangeably as dictated by convenience.) The probability ampli- 
tudes, which completely specify the state M/', are (n 1 n 2 « 3 |M(). Thus, the state may 
be represented equivalently by an indenumerable continuum of amplitudes, 
{x, y, z|^> = i/j(x, y, z), or (p x , p y , p z \V) = <j)(p x , p y , p z ), or by a denumerably 
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infinite set of discrete amplitudes like ( n x n 2 « 3 1 In general, an amplitude may 
depend on both discrete and continuous variables, and some useful physical quan- 
tities, such as the energy of an atom, have eigenvalue spectra that are part discrete 
and part continuous. 

It is convenient to be able to encompass both kinds of spectra, discrete and 
continuous, in a unified notation, and there are several ways of doing this. The 
Stieltjes integral is a mathematical tool and a generalization of the Riemann integral, 
allowing discrete “sums to be regarded as special cases of integrals. 1 Delta functions 
may be employed as densities in integrations to represent discrete sums as integrals. 
Conversely, one can think of the continuous spectrum of an observable as the lim- 
iting case of a completely discrete spectrum of a slightly different observable. The 
difference between the two usually arises from a modification of the boundary con- 
ditions or from the neglect of a small “interaction” term in the operator. 

In this chapter, most equations are written in a manner appropriate for discrete 
eigenvalues of the observables, but it is always easy to replace sums by integrals 
and Kronecker deltas by delta functions, when the spectrum is continuous (see Sec- 
tion 9.6). 

To keep the notation uncluttered, as long as no confusion is likely to occur, 
operators will be denoted simply as A, B, K, etc., just like the physical quantities 
they represent; their measured numerical values or eigenvalues will be denoted as 
A'j, Bj, etc. ( but Ki instead of K'i), whenever the need to distinguish them from the 
operators calls for caution. 

In the remainder of this section, we will infer the mathematical properties of 
the amplitudes from the probability interpretation and the laws of quantum physics. 
In every instance, these mathematical properties are satisfied by the probability am- 
plitudes (9.4) of one-dimensional quantum mechanics (wave mechanics). 

First, since the values K t constitute all the possible results of measuring the 
observable K, the probabilities must add up to unity: 


2 = 1 


(9.5) 


expressing the normalization of the probability amplitudes. It is customary to denote 
by (A-|5j) the probability amplitude for finding A\ upon measuring A on a system 
in a state for which a measurement of B yields the value B ) with certainty. In par- 
ticular, since the different measured values of K are supposed to be unambiguously 
distinguishable and mutually exclusive, we must require 

(Ki\Kj) = 0 if i * j (9.6) 

The amplitude (Ki\ K t ) must have modulus unity, and it is therefore possible to make 
the choice 


(Ki\Kj) = Sij (9.7) 

An arbitrary state ^P can be equivalently specified by the probability amplitudes 
for any complete set of observables, K, L, and so on. The formula connecting the 
amplitudes (K t | 'P) and (Lj | 'P) is patterned after the expansion postulate, by which 
states describing different outcomes of a measurement are superimposed, and rep- 
resents the distillation of the accumulated empirical evidence underlying quantum 


'Ballentine (1990), p. 12. 
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mechanics. The link between the probability amplitudes for two representations is 
established by the composition rule for probability amplitudes, 

(9.8) 


Exercise 9.1. Show the consistency of (9.8) with condition (9.7). 

The simple relation (9.8) embraces many of the surprising and peculiar features 
that set quantum mechanics apart from a classical description. If L is measured on 
a system in state (P, the probability of finding Lj is 

|(L;|^)| 2 = 2 (LjlK.XK^) 2 (9.9) 

I 

which differs from the conventional rule for calculating probabilities: 

2 \(L,\K t )\ 2 |<*i|*>| 2 (9.10) 

l* 

by the presence of interference terms. Expression (9.10) is not wrong, but it is the 
probability of finding Lj if an actual measurement of K is first carried out, leaving 
the system in one of the states in which a remeasurement of K would yield K t with 
certainty. Generally, such an interim measurement of K alters the initial state dra- 
matically and irreversibly, so that (9.10) does not represent the probability of di- 
rectly finding Lj for a system that is in state ' v f r . 

If the composition rule (9.8) is applied to state iMf) = | L„), we get 

2 <L; | *,•><*,- 1 L„> = (Lj\L n ) = S jn (9.11) 

i 

For fixed j, these equations permit the evaluation of the amplitudes (L y |iQ if the 
amplitudes {K t \L n ) are given. The solution can be inferred by comparing the set of 
linear equations (9.11) with the normalization condition 

2 \{Ki\Lj)\ 2 = X {Ki | Lj)*(Ki | Lj) = 1 
i i 

Putting n — j in (9.11), we are led to conclude that 

<Lj\K t ) = <g,-lZ^j (9.12) 

and, therefore, that there is an important reciprocal relationship: 

\(Kj\Lj)\ 2 = |(L,|fr)| 2 | (9.13) 

According to (9.13), the conditional probability of finding K t in a state that is known 
to “have” the sharp value L } for L is equal to the probability of finding Lj in a state 
that is known to have a sharp value K t for K. Equation (9.13) is in accord with all 
the experimental evidence. 

If conditions (9.11) and (9.12) are combined, the probability amplitudes {Lj\K t ) 
are seen to be subject to the conditions 

E (L j \K i )(L n \K i )* = 8 jn 



(9.14) 
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expressing the unitary character of the transformation matrix S, whose elements are 
defined as 


Sji = (Lj\K t ) (9.15) 

The unitary property of S implies that 

SS f = 1 (9.16) 

& 

Exercise 9.2. Show that Eqs. (9.5) through (9.16) are valid if they are trans- 
lated into the quantum mechanics of a particle with one position coordinate (wave 
mechanics in one dimension). Show that (9.14) expresses the orthonormality of 
eigenfunctions of observables as well as the closure property. 

The two-slit ifiterference experiment (Figure 1.1), interpreted in terms of a sta- 
tistical distribution of particles on the detecting screen, can be used to illustrate the 
rules for quantum mechanical amplitudes established in this section. In a typical 
interference setup, the two amplitudes (A | and ( B | for particles to appear in 
the separate slits or channels, A and B, completely determine the state M'. The same 
state can be described in terms of the amplitudes for the outcome of possible mea- 
surements in the region beyond the screen with the slits. If the position r is chosen 
as the observable for this description, the amplitude for finding the particle at r is 
related to the amplitudes (A|''P) and by the composition law: 

<r|¥> = <r|A)<A|¥> + <r|fl><fl|¥> (9.17) 

Interference phenomena arise from calculating the probability |(r|^)| 2 with this 
formula. 

Exercise 9.3. In a two-slit interference experiment with particles of definite 
wave number (energy) k, the slits A and B are located at positions r A and r s . At 
large distance from the slits the amplitudes (r|A> and (r|B) are to reasonable ac- 
curacy represented by (r|A) °c e ,fc l rA_r i and (r|B) °c e '*l r B _r l. Show how, to within 
a constant of proportionality, the probability of finding the particle at position r 
depends on the difference of the distances from A and B to the point of observation, 
and on the relative magnitude and phase of the amplitudes (A|M r ) and (5|T''), which 
are determined by the experimental arrangement. 

Two-slit interference is the paradigm for countless (real and thought) experi- 
ments in quantum physics. We extend the discussion to include the effect of an 
additional monitor that receives a signal alerting it to the passage of the particle 
through one of the slits. We characterize the point of observation of the particle 
(either at great distance r from the slits or on the focal plane of a converging lens) 
by the direction k of the diffracted wave (Figure 9.1). Under these conditions, we 
have an amplitude, (kAf|A), corresponding to the probability of finding the particle 
that is known to pass through A, subsequently to be diffracted in direction k and to 
cause the detector to fire. A second amplitude, (kM\A), corresponds to the comple- 
mentary outcome that the diffracted particle is not accompanied by a “hit” in the 
detector. There are analogous amplitudes (kM\B) and (kM\B) for passage of the 
particle through slit B. 



184 


Chapter 9 Vector Spaces in Quantum Mechanics 




Figure 9.1. Two-slit interference in the presence of a monitor that has the capacity to 
respond to the passage of a particle diffracted in direction k through slits A or B. In 
scenario ( a ) the monitor registers the passage with probability amplitudes (kA/| A) and 
( kM\B ), respectively. In scenario ( b ) the amplitudes (kM]A) and ( kM\B ) correspond to the 
monitor remaining in its latent initial state, while the particle is diffracted. The off-axis 
placement of the monitor is meant to suggest that it may be more sensitive to particles 
passing through one slit than the other. 

Given the state 'P, the amplitude for finding the particle to have been diffracted 
in direction k and the detector to have fired is, by the composition rule: 

(kM\V) = <kM|A)<A|^> + <kAf|fl><fl|¥> (9.18) 

Similarly, the amplitude for diffraction in direction k and no response from the 
detector is 

<kM|^> = <kM|A><A|¥> + <kM|B><5|^> (9.19) 

Generally, the probabilities |{kAf|^)j 2 and |{kM| , I r )| 2 , calculated from (9.18) and 
(9.19), exhibit interference. 
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If, by virtue of its placement near one of the slits or by some other technique, 
the monitor responds with high efficiency only to passage of the particle through 
one of the slits, say A, we have (kM|B) ~ (kM\A) ~ 0. We say that A is correlated 
with M, and B with M. The probability of observing the particle in direction k is 
then 

|<kM|^>| 2 + |<kM|^)| 2 ~ |{kM|A><A|^)| 2 + |<kM|B)(B|'I')| 2 (9.20) 

without an interference term. If, on the other hand, the detector monitors the passage 
of all particles through either of the two slits perfectly, but without the capacity to 
distinguish between the originating slit (perhaps because it is located halfway be- 
tween A and B), the probability of observing the particle in the direction k is 

|<kM|^)| 2 ~ |<kAf|A><A|¥> + (kM|B)<B|^)j 2 (9.21) 

This expression, like (9.17), leads to a typical two-slit interference pattern. In gen- 
eral, with a monitoring device M in place, the probability of observing a particle in 
direction k is 

|(kM|^)| 2 + |<kM|^)| 2 = |<kM|A)(A|^) + <kM|B)(B|^)| 2 

+ |<kM|A)(A|^) + <kM|B)(B|^)[ 2 1 ' 

We see that, for particles diffracted along k, the intensity exhibits interference ef- 
fects between the amplitudes pertaining to the two slits, unless we can unambigu- 
ously distinguish between the two distinct paths. We emphasize that for interference 
to be eradicated, the monitor M, which is correlated to the passage of the particle 
through the slits, does not need to produce recorded signals in the detector. It may 
indeed be idle, but in principle it must be able to distinguish path A from path B. 2 

In Section 9.4 we will see how the physically motivated rules for probability 
amplitudes can be fitted into a general mathematical framework. We will consider 
the probability amplitudes to be components of a vector ^ in a space whose “co- 
ordinate axes” are labeled by the different values AT,. We will refer to ’'P as a state 
vector, or briefly a state in the abstract vector space. 

To prepare for the vector space formulation of quantum mechanics, we review 
the fundamentals of the algebra of vector spaces and operators in the next two sec- 
tions. Two different but equivalent notations will be used in this chapter, because 
the Dirac bra-ket notation alone is not sufficiently flexible to meet all needs. 

In general, the states that have so far been considered require infinitely many 
amplitudes for their definition, and indeed many applications require the use of 
infinite-dimensional vector spaces. There are, however, important physical systems 
or degrees of freedom of systems, such as the spin of a particle, for which a finite 
number of amplitudes is appropriate. In order to keep the discussion simple, most 
of the concepts in this chapter will be developed for complex vector spaces of a 
finite number of dimensions, n. The infinite-dimensional vector spaces that are im- 
portant in quantum mechanics are analogous to finite-dimensional vector spaces and 
can be spanned by a countable basis. They are called separable Hilbert spaces. We 
will be interested only in those properties and theorems for n — > °° which are 
straightforward generalizations of the finite-dimensional theory. If, for these gen- 


2 This discussion is similar to, but more general than, that in Feynman (1965), vol. Ill, Chapter 
3. For descriptions of many fundamental experiments relevant to understanding quantum mechanics, 
see Ldvy-Leblond and Balibar (1990). 
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eralizations to hold, the vectors and operators of the space need to be subjected to 
certain additional restrictive conditions, we will suppose that these conditions are 
enforced. By confining ourselves to the complex vector space in n dimensions, we 
avoid questions that concern the convergence of sums over infinitely may terms, the 
interchangeability of several such summations, or the legitimacy of certain limiting 
processes. Nevertheless, the mathematical conclusions that we draw for the infinitely 
dimensional space by analogy with n-dimensional space can be rigorously justified. 3 

2. Vectors and Inner Products . Our abstract vector space is defined as a collec- 
tion of vectors, denoted by *P, any two of which, say and >P fe , can be combined 
to define a new vector, denoted as the sum *P a + ^P fc , with properties 

% + % = % + % (9.23) 

% + (% + %) = 0P a + %) + % (9.24) 

These rules define the addition of vectors. 

We also define the multiplication of a vector by an arbitrary complex number 
A. This is done by associating with any vector NP a vector A'P, subject to the rules 

(A + /z)\P = ANP + /t/'P, /z(A''P) = (pXyV (A, p: complex numbers) (9.25) 

A0P a + %) = A'P* + A'P*, and 1 • ¥ = ¥ (9.26) 

The vector space contains the null vector, 0, such that 

'P + 0 = 'P and 0-^ = 0 (9.27) 

for any vector ' V P. The null vector 0 will sometimes be written just plainly as 0, as 
if it were a number. This is not strictly correct, but should not be misleading. 

The k vectors ''Pj, ^P 2 , . . . , 1 P* are said to be linearly independent if no relation 


A^ + A 2 ^ 2 + • • • + \ k V k = 0 

exists between them, except the trivial one with Aj = A 2 = • • • = A* = 0. The 
vector space is said to be n-dimensional if there exist n linearly independent vectors, 
but if no ra + 1 vectors are linearly independent. 

In an n-dimensional space, we may choose a set of n linearly independent vec- 
tors 1 P 1 , SP 2 , . . . , \P„ . We refer to these vectors as the members of a basis, or as 
basis vectors. They are said to span the space, or to form a complete set of vectors, 
since an arbitrary vector ^P can be expanded in terms of them: 

n 

V = X a.% (9.28) 

i=i 

The coefficients a, are complex numbers. They are called the components of the 
vector ’'P, which they are said to represent. The components determine the vector 
completely. The components of the sum of two vectors are equal to the sum of the 
components: If \P a = ^ ) an d ‘'P* = 2 b t ;'P/, then 

V a + % = 2 («<• + bd% (9.29) 

i 


3 For a compendium of mathematical results and caveats, see Jordan (1969). More detail is found 
in Reed and Simon (1975-80) and Riesz and Nagy (1955). See also Fano (1971). 
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and similarly 

A % = X (Aa,)*, (9.30) 

I 

by the above rules for addition and multiplication. 

Next we introduce an inner (scalar) product between two vectors, denoted by 
the symbol (\P a , "'I'),). This is a complex number with the following properties: 

(¥*, ¥ a ) = OP., %)* (9.31) 

where the asterisk denotes complex conjugation. In the inner product (\P a , ^P*), 'P a 
is called the prefactor, and *P b is the postfactor. Their order is important. 

We further require that 

0P«, A'P*) = ACP a , ¥*) (9.32) 

From (9.31) and (9.32) it follows that 

(A¥ a , %) = A*0P., V b ) (9.33) 

We also postulate that 

OP*, + %) = (%, V b ) + OP., %) (9.34) 

and that 

OP, ^P) > 0 (9.35) 

w ith the equality sign holding if and only if ' V P is the null vector. The value 
VOP, '\P) = ||' V P|| is called the norm or the “length” of the vector 'P. A vector for 
which OP, ^P) = 1 is called a unit vector (or a normalized vector). 

Two vectors, neither of which is a null vector, are said to be orthogonal if their 
inner product vanishes. 

It is possible to construct sets of n vectors that satisfy the orthogonality and 
normalization conditions (often briefly referred to as the orthonormality property) 

(%,%) = 8 t j (i, j = l,...,n) (9.36) 

Since mutually orthogonal vectors are linearly independent, an orthonormal set can 
serve as a suitable basis. Generally, we assume that the basis vectors form an or- 
thonormal set, but occasionally, as in Section 8.4, a nonorthogonal basis is preferred. 

By taking the inner product with one of the orthonormal basis vectors, we obtain 
from the expansion (9.28) the formula 


°i = (%, n 


(9.37) 


for the components of \P. 

The scalar product of two arbitrary vectors is expressible as 

(*«, ft) = (2 «%, 2 w) = 22 Aft *,) = 2 b, <9.38) 

\ ‘ j / i j i 

In particular, for the square of the norm of a vector 


op, ¥) = ii'pfl 2 = X kl 2 


(9.39) 



188 


Chapter 9 Vector Spaces in Quantum Mechanics 


Exercise 9.4. If f a Q P) is a complex-valued (scalar) functional of the variable 
vector ''P with the linearity property 

fa(W b + pty e ) = AfM + fxf a (%) (9.40) 

where A and /jl are arbitrary complex numbers, show that f a can be represented as 
an inner product, 

/«(¥) = 0P a , ^) (9.41) 

for every 'P in the space and that (9.41) defines the vector <P a uniquely. This is the 
finite-dimensional analogue of Riesz ’s representation theorem, which assures us that 
a vector is fully and uniquely specified by its projections on all vectors in the space. 

3. Operators. We are now in a position to define operators in the vector space. 
An operator A is a prescription or mapping by which every vector 'P in the space is 
associated with a vector ’'P' in the space. Thus, 'P' is a function of ''P, and the 
notation 


= AOP) (9.42) 

is employed. The special class of operators that satisfy the conditions 

A0P a + W b ) = A(%) + A(V b ) (9.43) 

A(A^) = AAOP) (A: arbitrary complex number) (9.44) 

is most important to us. Such operators are called linear. For linear operators the 

parenthesis in (9.42) can be dropped, and we may simply write 

= A^ (9.45) 

thus stressing that the application of a linear operator is in many ways similar to 
ordinary multiplication of a vector by a number. 

Exercise 9.5. Prove that any linear operator A has the property AO = 0, if 0 
is the null vector. 

On occasion we will also deal with antilinear operators. These share the prop- 
erty (9.43) with linear operators, but (9.44) is replaced by 

A(A^) = A*A^ (9.46) 

Unless it is specifically stated that a particular operator is not linear, we will assume 
every operator to be linear and usually omit the adjective “linear.” 

Two operators, A and B, are equal if A'P = Bty for every 'P. Just as numbers 
can be added and multiplied, it is also sensible to define sums and products of 
operators by the relations: 

(A + B)V = AW + BV (9.47) 

(AS)'P = A(S^) (9.48) 

The last equation states that the operator AS acting on 'P produces the same vector 
that would be obtained if we first let S act on 'P and then A on the result of the 
previous operation. But whereas with numbers ab = ba, there is no need for oper- 
ators to yield the same result if they are applied in the reverse order. Hence, in 
general AS 4=- BA, although in particular cases two operators may, of course, com- 
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mute. For instance, A does commute with any function /(A) if f(x ) is an analytic 
function of the variable x. 

A trivial example of a linear operator is the identity operator, denoted by 1, 
with the property that 

'vjr — 

for every The,, operator A 1, where A is a number, merely multiplies each vector 

by the constant factor A. Hence, this operator may be simply written as A. 

A less trivial example of a linear operator is provided by the equation 

V = VjPa, (9.49) 

where is a given unit vector. This equation associates a vector M/ 1 ' with every ' v I r . 
The association is a linear one, and we write (9.49) as 

= P a V (9.50) 


defining the linear operator P a . 

Reasonably, P a is termed a projection operator, since all M*' are in the “direc- 
tion” of M*,, and the “length” of ’'P' equals the absolute value of the component 
01^, SP) of \P in that “direction.” 


Exercise 9.6. Prove that P a is a linear operator. 

Since 

P u/ — sir 

A a 1 a 1 a 

it follows that for any vector \P, 

Pi ¥ = P a (PaV) = PaVaWa, ¥) = VaWa, = P<& 

Thus, projection operators are idempotent, i.e., 

Pi = P a (9.51) 

In particular, for the projections on the basis vectors we have 

P& = qr.(qr., qr) = qr. a . ( 9 . 52 ) 


Hence, 

P t Pj9 = cijPjVj = 0 

Consequently, the projection operators for the basis vectors have the property 

P s Pj = PjPi = 0 if i * j (9.53) 

Note also that for every M r 


n n 

X W = X ai% = ^ 

/= 1 i= 1 

Hence, 


X P, = 1 

i= 1 


(9.54) 
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When a basis is given, an operator A can be characterized by its effect on the 
basis vectors. Indeed, being again a vector in the space, A^P) can obviously be ex- 
panded as 


A% = X %Ay (j = 1, 2 ,...,«) (9.55) 

i— 1 

where the A y are n 2 numbers which, owing to the linearity of A, completely specify 
the effect of A on any vector. To see this explicitly, we note 


*<■ = = * 2 <h% = 2 ajA% = S 2 a/M* = 2 %[2 A„a) 

J j j i i \ i / 

Hence, since ^ b/'P,, we have 


j 


(9.56) 


proving the contention that the effect of A on any vector is known if all A y are 
known. The order of the indices on the right-hand side of the defining equation (9.55) 
is deliberately chosen to produce the simple relation (9.56). 

Equation (9.56) can be written conveniently in matrix form as 


l b '\ 

b 2 


A 2 i 

A 12 

A 22 

' * 

■ ’ A 2n 

/ a i\ 

a 2 

\bj 


1 

A n 2 

A nn / 

w / 


(9.57) 


The possibility of using matrix notation here is not the result of any strange coin- 
cidence. Rather, the peculiar rule by which matrices qre multiplied is naturally 
adapted to the theory of linear transformations and any calculation in which linear 
quantities play a fundamental role. 

The inner product of two vectors can also be written in matrix notation. Ac- 
cording to (9.38), we have 


Wa, %) 


S a *b, = (a*at 


I M 

^2 

\bj 


(9.58) 


The choice of a particular basis determines the matrices in (9.57) and (9.58). 
The column matrices 


i:\ 


and 


/ M 

[ b 2 I 
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represent the vectors “9 a and M'*, and the square matrix 



represents the operator A. For this reason, we say that all these matrices constitute 
a representation of the abstract vector space. The matrix elements of A in a given 
representation with an orthonormal basis can be calculated by the formula 

Ay = (%, AT',)] (9.59) 

which follows immediately from (9.55) and (9.36). 

As an example, if is an arbitrary unit vector with components a t , the matrix 
elements of the projection operator P a are 

(Pah = (%, %(V a , %)) = Wa, %)(%, V a ) = <$0, (9.60) 

Exercise 9. 7. Show that for a linear operator 

(**, AV a ) = 22 bUijaj (9.61) 

i i 

and write this equation in matrix form. If A were antilinear, how would the corre- 
sponding equation look? 

Exercise 9.8. If F A (ty a , %) is a complex-valued (scalar) functional of the two 
variable vectors and M/), with the linearity properties 

F a ( w a + PL%, V b ) = A *F A (V e , V b ) + ijl*F a (V c , y b ) 

Fa(%„ W b + = AF A (^ a , V b ) + iJLF A (V a , %) 

show that F a can be represented as an inner product, 

F A (V a , = (¥„, AV b ) (9.62) 

for every M*, and and that (9.62) defines a linear operator A uniquely. (Compare 
with Exercise 9.4.) 

The matrix representing the sum of two operators is obtained by adding the 
matrices representing the two operators: 

(A + B)y = Ay + By (9.63) 

For the matrix of the product of two operators, we have 


(AB)y = (%, AB%) = (^, A 2 

= 2 W, A%)B v = 2 A ik B kj 


(9.64) 


This result shows that the matrix of an operator product equals the product of the 
matrices representing the operators, taken in the same order. 
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Exercise 9.9. Prove the multiplication rule (9.64) directly from (9.55) without 
assuming the basis to be orthonormal. 

Although there is a potential for confusion, we will denote the matrix repre- 
senting an operator A by the same symbol A. We thus emphasize the parallelism 
between linear operators and matrices and rely on the context to establish the in- 
tended meaning. However, the reader should avoid a complete identification of the 
two concepts, tempting as it may be, because the matrix A depends on the particular 
choice of basis vectors, whereas the operator A is a geometric entity, represented 
by a different matrix in every representation. We will return to this point when we 
consider the connection between different bases. 

It follows from Exercise 9.8 that corresponding to any given linear operator A 
we may define another linear operator, denoted by A f and called the ( Hermitian ) 
adjoint of A, which has the property that for any two vectors, and ^ b , 

M A f V b ) = (A%, V b ) (9.65) 

Specializing to unit vectors = NP), we see that 

(A% = (A)*- 


Thus the matrix representing the operator A + is obtained from the matrix representing 
the operator A by complex conjugation and transposition: 

A + - A* (9.66) 

where the symbol A is used for the transpose of matrix A, and A* is called the 
Hermitian conjugate of A. Note also that 

Wa, A%) = (A%, = 0P i5 A + ^J* - (A + ^ fl , V b ) 

From this and (9.65) we see that an operator can be moved at will from its position 
as multiplier of the postfactor in an inner product to a new position as multiplier of 
the prefactor, and vice versa, provided that the operator’s adjoint is taken. 

An important theorem concerns the adjoint of the product 

(AB) + = SU* (9.67) 

for operators or the matrices representing them. The proof is left to the reader. 

Exercise 9.10. If A is a linear operator, show that it is generally not possible 
to define a (“transpose”) linear operator A, which satisfies the equation 

(.%, AV a ) = (p a , AM (9.68) 

Show, however, that Eq. (9.68) defines an antilinear operator A t if A is itself anti- 
linear. 

A linear operator which is identical with its adjoint is said to be Hermitian (or 
self-adjoint). For a Hermitian operator, 

A f — A (9.69) 

Hermitian operators thus are generalizations of real numbers (which are identical 
with their complex conjugates). 
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If A is a Hermitian operator, the corresponding matrix A satisfies the condition 

A* = A (9.70) 

That is, the matrix elements that are located symmetrically with respect to the main 
diagonal are complex conjugates of each other. In particular, the diagonal matrix 
elements of a Hermitian operator are real. Matrices that satisfy condition (9.70) are 
also called Hermitian. 

a 

For a Hermitian operator 

Wa, AV a ) = (A'P*. 'Pa) = 0P fl , A9 a )* = real (9.71) 

The physical interpretation makes important use of the reality of this scalar product 
which is brought into correspondence with the expectation value of a physical quan- 
tity represented by the Hermitian operator A. 

An example of a Hermitian operator is afforded by the projection operator P a . 
Indeed, 


(%, P a %) = (%, %<p e , %)) = 0P a , 'PJOP*, V a ) 

= %) = C9aWa, ¥*), Vc) = (P a V b , %) 

A Hermitian operator A is said to be positive definite if for every vector 'P, 

OP, A\P) > 0 (9.72) 


Exercise 9.11. Prove that projection operators are positive definite. 


For positive definite operators we can derive a useful generalized Schwarz in- 
equality by substituting in (9.72) the vector 

'P = + A 'P* (9.73) 

where *P a and 'P b are arbitrary vectors, and A may be any complex number. We 
obtain 


OP, A¥) = 0P a , AV a ) + A0P a , A'P*) + A*0P*, A9 a ) 4- A*A0P & , A'P*) > 0 

The “best” inequality is obtained if A is chosen so as to minimize the left-hand 
side. By differentiation, the value of A that accomplishes this is found to be 

A = OP*. A'Pq) = 

OP*. A%) 0P b , A'P*) 


Substituting this value of A into the above inequality, we obtain the important result: 


QP a , A'PjQPfr, A^ b ) > |QP a , A'P,)) 2 


(9.74) 


For the choice A = 1, the Schwarz inequality 

0P o , q'JQP,, 'PJ > | QP a , ^ b )| 2 


(9.75) 


is derived. 

It follows from the Schwarz inequality that the inner product of two vectors 
('Pa, 'Pfc) is finite if the norms of 'Pa and 'P fc are finite. The Schwarz inequality may 
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be interpreted geometrically as stating that the cosine of the “angle” a between two 
vectors 

= 1 Wa, ^)j 

Vof*, %) II II** I! 

has modulus less than or equal to unity. 

We note that in (9.75) the equality sign holds if and only if OF, 'F) = 0, i.e. if 
+ A'F*, = 0, or if and only if ^F a and ' V F* are multiples of each other, which may 
be interpreted to imply that they are “parallel” vectors. 

A linear operator A, which is defined by 

'F' - ASP (9.45) 

may or may not have an inverse. An operator B which reverses the action of A, such 
that 

^F = BV (9.76) 

exists only if (9.45) associates different vectors NF' and with any two different 
vectors ^F a and NF & or in other words if the operator A preserves linear independence. 
Hence, for such an operator, as ^F runs through the entire n-dimensional vector 
space, ’F' does the same. We may substitute (9.45) in (9.76) or vice versa, and 
conclude that 

AB = 1 and BA = 1 (9.77) 

The operator B is unique, for if there were another operator B' with the property 
AB' = 1, we would have A(B' — B) = 0, or BA(B' — B) — 0; according to the 
second of the equations (9.77), this implies B’ — B = 0. It is therefore legitimate 
to speak of the inverse of A and use the notation B — A -1 . Evidently, 

(AM- 1 = A^Af 1 (9.78) 

If an operator A has an inverse, there can be no vector ' V F (cither than the null vector) 

such that 

A'F = 0 (9.79) 

Conversely, if there is no nontrivial vector which satisfies (9.79), then A has an 
inverse. 

Exercise 9.12. Show that a projection operator has no inverse (if the vector 
space has more than one dimension). 

The matrix that represents A -1 is the inverse of the matrix A. A necessary and 
sufficient condition for the existence of the inverse matrix is that det A 0. 

A linear operator whose inverse and adjoint are identical is called unitary. Such 
operators are generalizations of complex numbers of absolute value 1, that is, e ia . 
For a unitary operator U: 

U~ l = If or Ulf = ifU = 1 (9.80) 

Evidently, 

OF*, ¥*) = (W F fl , U%) 


(9.81) 
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Hence, a unitary operator preserves the norms or “lengths” of vectors and the “an- 
gles” between any two of them. In this sense U can be regarded as defining a 
“rotation” in the abstract vector space. In fact, the matrix representing U satisfies 
the condition of unitarity, 

UO* = 1 (9.82) 

If U is a real matrix, condition (9.82) becomes identical with the orthogonality 
relation in Euclidean space, emphasizing the formal analogy between unitary oper- 
ators in the complex vector space and rotations in ordinary space. 

Exercise 9.13. Prove that products of unitary operators are also unitary. 

Hermitian and unitary operators are special cases of the class of normal oper- 
ators. A linear operator A is said to be normal if it commutes with its adjoint: 

[A, A f ] = 0 (9.83) 

An arbitrary operator A can be written according to (4.6) as 

A = A + i A -~~ =H X + iH 2 (9.84) 

2 2 i 

where both H x and H 2 are Hermitian operators. The operator A is then seen to be 
normal if and only if 

[H u H 2 \ = 0 (9.85) 

In dealing with expressions containing operators and their commutators, it is helpful 
to have recourse to some algebraic rules and identities. Since these were already 
developed and compiled in Section 3.4, there is no need to rederive them here. They 
are as valid in abstract linear vector space as in the coordinate or momentum rep- 
resentation. 


4. The Vector Space of Quantum Mechanics and the Bra-Ket Notation. We can 
now proceed to express the physical principles of Section 9.1 in terms of the math- 
ematics Of vector spaces summarized in Sections 9.2 and 9.3. 

The physical state 'SP of a system is fully characterized by the probability am- 
plitudes (K, | M r ) relative to a complete set of observables. These probability ampli- 
tudes may now be taken to be the components of a state vector, which we shall also 
denote by 'P. The basis vectors 'NP, are the states in which the observable K may, in 
the interest of brevity, be said to have the sharp value K t . By this we mean only that 
the particular value is certain to be found if K is measured in state \P An equiv- 
alent notation is (.K'/I'P) = (' V P, | 1 P). The state *P is thus expressed as 

^ = E = E (9.86) 

i i 

from which we see that, on account of the orthonormality relation (9.36), 


(gtljQ = QP,-|¥) = QP t -, 'P) 


(9.87) 


demonstrating the identification of the probability amplitude (£',| , 'P) or ( 1 P,|''P) with 
the inner product ('P,, 'P). From now on the three expressions (9.87) will therefore 
be used interchangeably with no misgivings. By taking the inner product of (9.86) 
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with the state in which the observable L has the sharp value Lj, Eq. (9.86) is seen 
to be consistent with the composition rule (9.8) of probability amplitudes. This cir- 
cumstance, above all, testifies to the appropriateness of using vector spaces as the 
formal structure in which to conduct quantum mechanics. 

Instead of thinking of the inner product (9.87) as a product of two vectors T' 
and M*-, properly ordered, we can also think of the two factors as vectors in two 
different spaces. The postfactor M* is said to be a vector in ket-space and is denoted 
by The prefactor ^ is a vector in bra-space and is denoted by 0P,.|. Their 
product (M^H 4') is defined to be the inner product (M r ; |T r ). The distinction between 
the two vector spaces arises because the inner product is not commutative, since 
generally 

WaW = <¥*!*„>* * <%l v*) 

The same state can be expressed by either a ket vector |''P a ) or a bra vector OPj. 
Dirac has stylized this notation into |a) and (a\ for the two kinds of vectors. The 
inner product is written as 


QP„|%) = {a\b) = QP a , gfr) 


(9.88) 


This notation has led to the colorful designation of the (a | vector as a bra and the 
| a) vector as a ket. To every ket | a) there corresponds a bra {a\, and vice versa, 
subject to the conditions 

| a) + \b) (a\ + (b\ (9.89) 


and 

A | a) A*<a | (9.90) 

where the arrows indicate the correspondence between the two spaces. Taken by 
itself, each one of the two spaces is a vector space satisfying postulates (9.23) — 
(9.27). The two spaces are said to be dual to each other. I£ets are analogues of what 
in differential geometry are called vectors (or contravariant vectors), and we use the 
notation | M/} or 'P for them interchangeably. Bras | in this context are one-forms 
(or covariant vectors); and inner products are then also referred to as contractions. 
Equation (9.86) can be written in ket form as 

W = 2 |K,XK,I'P> (9.91) 

* 

The corresponding bra equation is 

<¥| = E (^|W,I = 2 (9.92) 

i i 

The components of a bra are complex conjugates of the components of the corre- 
sponding ket, since 

(b\a) = (a\b)* (9.93) 

The state |''J r ) is normalized to unity if 

OW = 2 £ = 2 = 1 (9.94) 

i j i 

consistent with the normalization condition (9.5). 




4 The Vector Space of Quantum Mechanics and the Bra-Ket Notation 


197 


The bra-ket notation also extends to operators. A linear operator A associates 
with every ket |a) another ket, 

| b) = A\a) 

such that 

A(|a,> + | a 2 » = M a \) + A\a 2 ) 
and 

AA|a> = A(A | a)) 

Some unnecessary vertical bars on the left of certain kets have been omitted for 
reasons of economy in notation. 

By letting the equation 

<c|{A|a}} = { <c j A } \a) = (c\A\a) (9.95) 

define a bra (c|A, we may allow a linear operator also to act from right to left, or 
“backwards.” The rules for multiplication to the left are 

«<h| + <c 2 |)A = (cj|A + <c 2 |A 

and 


«c|A)A = «c|A)A 

We thus see that A is a linear operator in bra-space as well as in ket-space, and we 
are justified in writing the expression (9.95) unambiguously as (c|A|a). We can 
consider the operator as acting either to the right or to the left, whichever is con- 
venient, emphasizing the symmetry of the dual spaces. 

The definition (9.65) of the Hermitian adjoint operator becomes, in bra-ket 
notation, 

(a|A t |c) = (c|A|a)* (9.96) 

Hence, the general correspondence 

\b) = A\a) (b\ = (<z|A + (9.97) 

is established. 

A Hermitian operator A is characterized in this notation by 

(a\A\b) = (b\A\a)* (9.98) 

An example of a Hermitian operator is the projection operator P a , defined by 
(9.49) and (9.50). In bra-ket notation, it is written as 

P a = W)(a\ (9.99) 

Using the projection operators for the basis vectors 

Pi = \Ki){Ki\ (9.100) 

a general state, as given by (9.91) and (9.92), can be expressed in terms of projection 
operators as 


\v) = 2 = 2 ^ 1 ^) 


(9.101) 
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and 

<*| = 2 <^|XiKKi| = 2 <*1?, (9.102) 

i i 

Since l^) is arbitrary, it follows that the projection operators P t satisfy the closure 
relation 

(9.103) 

The probability doctrine of quantum mechanics leads to the definition of the 
expectation value (K) of an observable K in the state j'P). If I'P) is normalized to 
unity: 

(K) = 2 KMKm 2 = X >*,<*, |*> (9-104) 

i i 

where K t are the possible results of measuring K. From this expression we infer that 
the observable K is represented by the Hermitian operator 

(9.105) 

No confusion is likely to arise from the use of the same symbol K for both the 
observable and the operator that represents it. 

With (9.105), the expectation value (9.104) of K becomes 

(&)'='(¥ I kI^TI (9-106) 

which is one of the fundamental formulas of quantum mechanics. 

Exercise 9.14. Prove that K defined by (9.105) is a Hermitian operator if all 
numbers K t are real. 

If l^) = |£}}, all probabilities |{£,|' v I r )| 2 for i + j are zero, and l(£)|\P)| 2 = 1. 
Thus, in the state |jK}> the observable K assumes the sharp and definite value Kj. 
The state ) Kj) is an eigenvector of the operator K, as 

K\Kj) = X \K i )K i (K i \K J ) = X \K,)K t S v = Kj\Kj) (9.107) 

i i 

and the sharp value Kj is an eigenvalue of K. The eigenvalue problem of Hermitian 
operators in quantum mechanics will receive detailed consideration in Chapter 10. 

Just as a state is specified by its components in a given basis, an operator is 
specified by its matrix elements. As in (9.55), for an operator A these are defined as 
the expansion coefficients in the equation: 

A\Kj) = X \Ki)(Ki\A\Kj) = X \KMtj (9.108) 

i i 

which follows from the identity (9.103). By comparison with (9.59), we deduce that 

= (%, A%) = (Ki\A\Kj) (9.109) 

The matrix representing the special Hermitian operator K, whose eigenvectors ac- 
cording to (9.107) make up the chosen basis, has a particularly simple structure, 
since 




tv\r\v\ = KA., 


(9.110) 
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This matrix is “diagonal,” i.e. all off-diagonal elements ( i =£ j) are zero, and the 
eigenvalues of K are the diagonal matrix elements. 

The deceptive ease with which it is possible to derive useful relations by em- 
ploying the identity (9.103) is exemplified by a calculation of the matrix elements 
of the product of two operators A and B : 

(K,\AB\Kj) = (. K,\A1 B\Kj ) = (K,\A 2 \K e )(K e \B\Kj) 

_ € (Q nn 

= 2 <^if I A | K e )(K ( \B\Kj) 
e 

which is the matrix element of the product of the two matrices representing A and 
B, as in Eq. (9.64). 

Exercise 9.15. For an arbitrary normalized state |a) and an operator A, cal- 
culate the sum 2 Ko|A|/C ; )| 2 over the entire basis | AT, ) . What value is obtained if 

i i 

A is unitary? 


5. Change of Basis. In introducing the notion of the complex vector space for 
quantum mechanical states, we are guided by the similarity between the geometry 
of this abstract vector space and geometry in ordinary Euclidean space. 

A representation in our vector space corresponds to the introduction of a co- 
ordinate system in Euclidean space. Just as we study rotations of coordinate systems 
in analytic geometry, we must now consider the transformation from one represen- 
tation to another in the general vector space. __ 

Along with an old basis 'P), used in (9.28), we consider a new basis, The 
new basis vectors may be expressed in terms of the old ones: 


= 2 %s ik 


The nonsingular matrix of the transformation coefficients 

/ Sn S\ 2 • • • S ln \ 


s = 


>21 U22 


\Snl S n: 


>2 n 


(9.112) 


defines the change of basis. 

A succession of two such basis changes, S and R, performed in this order, is 
equivalent to a single one whose matrix is simply the product matrix SR. 

To obtain the new components a k of an arbitrary vector, we write 

% = 2 = 2 

i k 


Substituting NP* from (9.112), we get 

«, = 2 S ik d k (9.113) 

* 


la A 


(a A 

a 2 


«2 

• 

= S 

• 

\a„) 


\dnl 


or 
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The inverse of this relation is 


j a l\ 


M 

a 2 

= S -1 

a 2 

• 

\a„l 


W/ 


(9.114) 


We must also determine the connection between the matrices A and A repre- 
senting the operator A in the old and new representations. Evidently, the new matrix 
elements are defined by 

A% = 2 %A 9 = E E VuAy 

i i k 

But on the other hand 


A% = A E VtS<, = E E %A M S ej 

e t k 

Comparing the right-hand sides of these equations, we obtain in matrix notation 

SA = AS 


or 


A = S~ l AS 


We say that A is obtained from A by a similarity transformation. 


(9.115) 


Exercise 9.16. If /(A, B, C , . . .) is any function that is obtained from the 
matrices, A, B, C , ... by algebraic processes involving numbers (but no other, con- 
stant, matrices), show that 

f(S~ 1 AS, S-'BS, S-'CS, ...) = S-'fiA, B, C , . . .)S (9.116) 

Give examples. 


So far in this section it has not yet been assumed that either the old or the new 
basis is orthonormal. If nonorthogonal basis vectors are used, the transformation 
coefficients can generally not be calculated from the formula 

S ik = (%,V k ) (9.117) 

but this relation does hold if the old basis is orthonormal. Nonorthogonal basis 
vectors were allowed in the Rayleigh-Ritz method of variation-perturbation theory 
in Section 8.4. 

If, as is usually the case, both the old and the new basis are orthonormal, 
the bra-ket notation is convenient. We identify with the unit ket | J5T,-) and ^ k 
with |L fc ). The orthonormality of both bases is expressed as 

(K i \Kj) = 8 ij , (L n \L k )=8 nk (9.118) 

and their closure as 

E l^-X^I = E \Lj){Lj\ - 1 (9.119) 

< j 

Multiplying (9.119) from the right by | L k ), we get 

\L k ) = E 


(9.120) 
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linking the two representations just as Eq. (9.112) does. Hence, for orthonormal 
bases, (9.117) holds and can be written in the form 

S ik = C K,\L k ) (9.121) 

From (9.120), by multiplication by (M r | from the left and subsequent complex con- 
jugation, we obtain 

* (Lj\V) = X <fn\KiW\^) (9-122) 

i 

expressing the new components of the state M* in terms of the old components, 
similar to (9.1 14). This last equation is nothing more than the composition rule (9.8) 
for probability amplitudes. 

For two orthonormal bases, the transformation coefficients must satisfy the con- 
ditions 

X S* k S i( = X <jLk\Ki)<Ki\L { ) = (L k \L e ) = 8* (9.123) 

i i 

and 

X S ik S* k = X (Ki\L k )(L k \K e ) = (K t \K e ) = 8 ie (9.124) 

k k 

Hence, S must be a unitary matrix 

SS f = S f S=l (9.125) 

where S denotes the transformation matrix with rows labeled by the eigenvalues of 
L and columns labeled by the eigenvalues of K. 

Using the unitarity condition, we may rewrite the similarity transformation 
equation (9.115) for a matrix representing the operator A in the form 

A = S f AS ' (9.126) 

or more explicitly, 

(Lj\A\L k ) = X (L J \K i ){K t \A\K e )(K t \L k ) (9.127) 

ie 

An alternative interpretation of a unitary transformation consists of regarding 
(9.114) not as a relation between different representations of the components of the 
same vector (“passive” transformation), but as defining the components of a new 
vector in a fixed representation (“active” transformation). Comparing (9.114) with 
(9.57), we see that if the matrix S _1 = S t can be considered as connecting the 
components of two vectors in the same basis, it represents a unitary operator U that 
changes every vector 'F into a vector IFP. The operator A which maps the unitary 
transform U y ¥ of into the unitary transform U A 1 ? of A '4'" is defined by the equation 

A(UW) = U(AV) 

Hence, we have the operator equation 

A = UAU* (9.128) 

which agrees with the matrix equation (9.126). The two “rotations,” one (passive) 
affecting only the basis, and the other (active) keeping the basis fixed while rotating 
all vectors and operators, are equivalent but they are performed in opposite “direc- 
tions”; that is, one is the inverse of the other. 
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Exercise 9.17. Show that under an active unitary transformation a Hermitian 
operator remains Hermitian, a unitary operator remains unitary, and a normal op- 
erator remains normal. Also show that a symmetric matrix does not in general remain 
symmetric under such a transformation. 

Exercise 9.18. Show by the use of the bra-ket notation that 

trace A = 2 <AT ( |A| AT,) (9.129) 

i 

is independent of the choice of the basis | K t ) and that 

trace AB = trace BA (9.130) 

Exercise 9.19. Show that 

2 2 |<£,'|A|L / >| 2 = trace AA f = trace A f A (9.131) 

‘ j 

and that this expression is independent of the bases |£)) and j L } ). 

Dirac introduced the bra-ket notation in order to make the formal expressions 
of quantum mechanics more transparent and easier to manipulate. We will see in 
Section 9.6 that it also makes it easy to unify the formalism for finite- and infi- 
nite-dimensional, but separable, vector spaces (n — > °°) with which quantum me- 
chanics works. The bra-ket notation is not particularly convenient when, as in Sec- 
tion 8.4, nonorthogonal bases are employed, because such bases are not 
self-reciprocal. Generally, components of vectors are then not simple inner products, 
the matrix representing a Hermitian operator may not be Hermitian, and unitary 
operators are not necessarily represented by unitary matrices. 

6. Hilbert Space and the Coordinate Representation. As explained at the end of 
Section 9.1, we have so far assumed the vector space to be of a finite number of 
dimensions, n, and thus any operator to have at most n distinct eigenvalues. Yet, in 
ordinary wave mechanics, which motivated our interest in the mathematics of linear 
vector spaces in the first place, most observables have infinitely many possible val- 
ues (their eigenvalues), and many of those, instead of being discrete, are indenu - 
merable and form a continuum. If n is allowed to grow to infinity, a number of 
important and difficult mathematical questions arise, to which we have alluded on 
several occasions. 

If the limit n — > °° is to have meaning for quantum mechanics with its proba- 
bility amplitudes and expectation values, we must work only with a mathematical 
structure that allows us to expand state vectors in terms of complete sets of orthog- 
onal vectors. This strategy was already employed in Sections 4.3 and 4.4 for dis- 
cussing the continuous spectrum of such commonplace observables as the position 
or momentum of a particle. In this section, we will recover the fundamentals of wave 
mechanics in the framework of a vector space that supports operators with eigen- 
value spectra consisting of continuous portions in addition to discrete points. The 
bra-ket notation helps to express the unified theory of finite- and infinite-dimensional 
vector spaces in compact form. 

The principles spelled out in Sections 9.1 and 9.4, demand that if the eigenvalue 
equation for the observable K, 


K\K') = K’\K') 


(9.132) 
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has a continuous spectrum, only those eigenvectors are admissible for which the 
eigenvalue K' is real and which can be normalized in terms of delta functions rather 
than Kronecker deltas: 

(K'\K") = 8(K' - K") (9.133) 

With this normalization, all the formulas for the discrete and continuous cases are 
similar, except that integrals in the continuous case replace sums in the discrete 
case. For continuously variable eigenvalues, the prime notation to distinguish ei- 
genvalues ( K ') from operators ( K ) has been reinstated. 

Thus, an arbitrary state vector can be expanded as 

|¥> - X | £■,)<£, |¥> + f | K')dK'(K'\V) (9.134) 

Ki J 

the sum being over the discrete and the integral over the continuous eigenvalues of 
the observable K. For simplicity, we assume that there are no repeated eigenvalues 
of K. It is easy to generalize the formalism to accommodate repeated eigenvalues, 
and this will be done in Section 10.4. It is even possible for a particular eigenvalue 
to belong to both the discrete and continuous spectrum. The corresponding eigenkets 
must be orthogonal. For an example from atomic physics, see Section 19.7. 

The expansion (9.134) gives the expectation value of K as 

(K) = <¥|tf|¥> - 2 KMKiin 2 + f K'\(K' |^)| 2 die (9.135) 
Ki J 

showing that | (K' | 1 F) | 2 dK' is the probability of finding for the observable K a value 
between K' and K' + dK' when K' lies in the continuous part of the spectrum. Thus, 
|<*:'|'P>| 2 is the probability per unit K' -interval, or a probability density. 

The application of the formalism to wave mechanics for a point particle in one 
dimension is straightforward. The three-dimensional theory is worked out in Section 
15.1. Since we can measure the particle’s position along the x axis, there is a Her- 
mitian operator x for this observable. The results of a measurement can be any real 
number between — °o and + 00 , and the eigenvalues of x, denoted here by x', form 
a continuum. The corresponding eigenvectors are written as |jc') 

x\x') = x'\x') (9.136) 

with the assumed normalization. The closure relation is now 

J \x')dx'(x'\ = 1 (9.137) 

and the state is expanded as 

|¥> - J (9.138) 

The components (jt'I'P) constitute a complex-valued function of the real variable 
x' . By identifying it with the wave function, 

jjcO = (9.139) 

we establish the connection between the state vector \ 'F) and the wave function 
il/(x'). The coordinate jc' serves as the continuously variable label of the component 
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of the state vector IM*) in an infinite-dimensional abstract vector space. From 
this point of view, ') is merely one of many possible ways of representing the 
state vector. The representation used here is spanned by the eigenvectors of the 
position operator jc and is called the coordinate representation. 

The inner product of two states is represented by 

OPal^i) = || (V 2 \x")dx"(x"\x')dx'(x'\V l ) 

= || (V 2 \x") dx" 8(x" - x') dx'ix' |¥,> (9.140) 

= |_ '/'ICO^iC*') dx' 

The inner product of two orthogonal states is zero, in agreement with the earlier 
conventions (Section 4.1). 

In the coordinate representation, the matrix that represents an operator A is now 
a matrix whose indices, instead of being denumerable, take on a continuum of values. 
Although such a matrix can no longer be written down as a square array of numbers, 
it is nevertheless appropriate to use the term matrix element for the quantity 
(x"|A|x'). A function fix) is represented by the matrix element 

(x"|/(x)|x'> = fix')Six' - x") (9.141), 

This is said to be diagonal, since it vanishes for x" # x ' . 

Linear momentum in the x direction is another important operator for the sys- 
tem. We know that in the coordinate representation it acts on wave functions as 
ih/i ) d/dx ' , and we can use this information to conjecture that in the abstract vector 
space it is a Hermitian operator, p, which satisfies the commutation relation 

xp — px — ih 1 (9.142) 

For brevity we omit the subscript x on the momentum p, because we are dealing 
with a one-dimensional system. The fundamental relationship between linear mo- 
mentum and linear displacements, which is at the root of (9.142), was already dis- 
cussed in Section 4.5 and will again be taken up in Section 15.1. Here we merely 
assume the commutation relation (9.142) and deduce its consequences. 

Exercise 9.20. Taking the trace on the two sides of the commutation relation 
(9.142), show that a contradiction arises from the application of (9.130). Resolve 
the paradox. 

The matrix element of the operator equation (9.142), taken between the bra (x"| 
and the ket |x'>, is 

{x"\xp — px\x') = ix" — x')(x"|p|x') = ihSix" — x') 
from which we infer that 

<x"|p|x') = ih ^ = ih —7 Six" -x') = tA S(x" ~ *') (9.143) 

x - x dx i dx 


which, though highly singular, is a well-defined quantity. 
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Exercise 9.21. Show that for any function /(/?) that can be expressed as a 
power series of its argument, 

<*"l/(p)|*'> = f(j - *’) (9-144) 

It follows that the action of an operator f(p) on a state is represented as 

& 

/(P)|*> = 


and that 

0^2 I/O) I ^i> = J f(~ ^jfpi(x’)dx' (9.146) 

All of these results confirm that the identification of the operator p which satisfies 
the commutation relation (9.142) as the momentum, is correct. More generally, they 
reassure us that the abstract vector space (bra-ket) formulation of quantum mechan- 
ics contains wave mechanics as a representation. 


JJ \x’)dx' {x' \f(p)\x")dx"{x"\'¥) 

jj | x')dx' /( j ^«(x' - x")x Kx")dx" (9.145) , 

J | x')dx' /(y -^\ijf{x')dx' 


Exercise 9.22. Change from the coordinate basis to the momentum basis, 
showing that the transformation coefficients are 


(x'\ P r ) 


1 

s/lirh 




Represent states and operators in the momentum basis. Derive the equations con- 
necting the expressions for components of states and matrix elements of operators 
in the coordinate and momentum representations. 


As an illustration, we reconsider the linear harmonic oscillator with its Ham- 
iltonian or energy operator, 


p 2 mo/ x 2 
H = ~ + — - — 
2m 2 


(9.147) 


Denoting the eigenvalues of H by E as is customary (instead of H '), we see that the 
eigenvalue equation for the energy operator, 

H\E) = E\E) (9.148) 


appears in the coordinate representation as 

(p 2 + m 2 o/x 2 )\E ) = r— f \x')( -h 2 -^j)ijj E (x')dx' + m 2 o/ f \x')x' 2 4/ E (x')dx' 
2m 2m J \ dx j J 

= E J \x')dx’iff E (x’) 
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where i// £ (x') = (x'| E). Since all the vectors |x') are linearly independent, this 
equation requires that for the components of |£), labeled by x' , 


h 2 d 2 moo 2 

2m dx' 2 2 



Mx') = 0 


(9.149) 


which is nothing but the Schrodinger equation for the harmonic oscillator. Only 
those solutions are admissible for which E is real and the normalization 


(E 2 \E x ) = j = 8 12 or 8(E 1 - E 2 ) 

can be achieved. For an equation like (9.149), this condition is equivalent to 

«M+°°) = «M~°°) = 0 (9.150) 


From Chapter 5 we know that only discrete eigenvalues, 


E n = hoo\ n H — 

V 2 1 


(n = 0,1,2 ,.. .) 


exist for this Schrodinger equation and that, by (5.39): 

. , , , 1 1 / moo\ ( maox' . 

u |S„> = ) = __ f_l expl-— ltf„ 


,' 2 > 



(9.151) 


Exploiting the freedom to choose a representation, we may use the eigenvectors 
of H as the basis of a representation, which we designate as the (harmonic oscillator) 
energy representation. It is spanned by a denumerable infinity of basis vectors, la- 
beled by the quantum number n. Although it is qualitatively different from either 
the coordinate or the momentum representation, any of these three representations 
can be equivalently used to expand an arbitrary vector of the same Hilbert space. 
The transformation coefficients (x'\E n ) are subject to the unitarity conditions 


{E k \x')dx'{x'\E n ) = 8 kni (9.152) 

CO 

S <*'|£„><£„l*"> = S(X' - x") (9.153) 

n=0 


These conditions are satisfied by virtue of the orthonormality and completeness of 
the eigenfunctions (9.151). 


Problems 

1. If iA„(r) is the normalized eigenfunction of the time-independent Schrodinger equa- 
tion, corresponding to energy eigenvalue E n , show that | if/ n ( r)| 2 is not only the prob- 
ability density for the coordinate vector r, if the system has energy E n , but also 
conversely the probability of finding the energy to be E n , if the system is known to 
be at position r. 

2. Using the momentum representation, calculate the bound-state energy eigenvalue and 
the corresponding eigenfunction for the potential F(x) = — g8(x) (for g > 0). Compare 
with the results in Section 6.4. 



CHAPTER 10 


Eigenvalues and Eigenvectors of 

Operators, the Uncertainty Relations, 
and the Harmonic Oscillator 

A thorough understanding of the eigenvalue problem of physically 
relevant operators and of the corresponding eigenvectors (eigenstates) is 
essential in quantum mechanics. In this chapter, we examine some further 
ramifications of this problem. The physical significance of commutation 
relations will be discussed and illustrated by the Heisenberg uncertainty 
relations. The chapter concludes with a return to the harmonic oscillator, 
now in terms of raising (creation) and lowering (annihilation) operators, 
preparing for applications in many-particle theory. Coherent oscillator 
states shed light on the connection with classical mechanics and are 
central to the interpretation of processes in quantum optics. Squeezed (or 
stretched) oscillator states make it possible to give an introduction to the 
concept of quasiparticles. 


1. The Eigenvalue Problem for Normal Operators. A ket |A •) is called an 
eigenvector, or eigenket, of the operator A if 

a|a;> = a;|a;> (io.i) 

The number A' which characterizes the eigenvector is called an eigenvalue. The 
effect of A on |AJ) is merely multiplication by a number. We first give our attention 
to the eigenvalue problem for normal operators, which include Hermitian and unitary 
operators. In Section 10.7 the discussion will be extended to an important nonnormal 
operator, the annihilation operator. Different eigenvalues will be distinguished by 
their subscripts. An eigenvalue enclosed in a ket | ), as in |A'), names the eigenket 
belonging to the eigenvalue A'. 

Our main objective will be to prove that, at least in a finite-dimensional vector 
space, every normal operator has a complete set of orthonormal eigenvectors, which 
may be used as basis vectors spanning the space. The normal operator A may have 
repeated eigenvalues. By this we mean the occurrence of more than one linearly 
independent eigenvector belonging to the same eigenvalue. (When this happens for 
the Hamiltonian operator of a system, we speak of degenerate energy eigenvalues.) 

Since any linear combination of eigenvectors belonging to the same eigenvalue 
is again an eigenvector belonging to the same eigenvalue, these eigenvectors form 
a subspace of the original vector space. The original linearly independent eigenvec- 
tors in this subspace may be replaced by an equal number of orthogonal eigenvec- 
tors, all of which correspond to the same eigenvalue. This is accomplished by a 
successive orthogonalization algorithm ( Schmidt orthogonalization method), the es- 
sence of which was already described in Section 4.1 and illustrated in Figure 4.1. 
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If M'j and M^2 are two nonorthogonal eigenvectors with repeated eigenvalues, we 
construct as a replacement for the eigenvector 


= M/ 1 2 


^2) = ^ 
(%, ^i) 




( 10 . 2 ) 


where is the projection operator in the “direction” of , 'P' 1 . The new eigenvector 
^2 belongs to the same eigenvalue as 'Fi and but it is orthogonal to MV 


(%, ^2) = 0 


If there is a third linearly independent eigenvector, \F 3 , belonging to the same re- 
peated eigenvalue, we replace it by 

¥3 = [1 - Peru ~ Pc*»W 3 ( 10 . 3 ) 

which is orthogonal to both % and MV This procedure is continued until a complete 
orthonormal basis has been constructed for the subspace of eigenvectors belonging 
to the repeated eigenvalue. In the bra-ket notation, the occurrence of repeated 
eigenvalues requires that the name of an eigenket be specified by additional labels 
to supplement the information conveyed by the common eigenvalue A J . 


Exercise 10.1. Show that M^ in (10.3) is orthogonal to both M^ and MV 


For a normal operator, for which by definition 

[A, A f ] = 0 


(10.4) 


we infer from (10.1) that 

A + A|A-> - AA f \A' t ) = A', A f |A •) (10.5). 

Hence, A f |A •) is an eigenket of A, belonging to the eigenvalue A ■ and may be written 
as 

A f |AJ> = a|A5> + |/3> . (10.6) 

where |/3) is also an eigenvector of A with eigenvalue AJ- . We may assume that \j3) 
is orthogonal to | A-); i.e., (Aj|jS) = 0. From (10.1) we have also 

(A - |A f = A -*(A • | (10.7) 

as well as 

<^| A t = (10.8) 

If we multiply (10,6) on the left by (A'| we obtain, using (10.7), 

a;*<a;|a;> = q:(a;|a;> or a = a;* 

If we substitute this result in (10.6) and then multiply on the left by (j8|, we find, 
using (10.8), that </3|/3) = 0 and that thus for all eigenvectors, 

A f |A ■) = Ai*|A-> (10.9) 

as well as 

<a;|a = A’M\\ (10.10) 

From Eqs. (10.1) and (10.10) we obtain 

(M - Aj){A'i\A'j) = 0 


( 10 . 11 ) 
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showing that any two eigenkets belonging to different eigenvalues of a normal op- 
erator are orthogonal. 

We thus conclude that all linearly independent eigenvectors of a normal oper- 
ator may be assumed to be orthogonal. It is convenient to take these eigenvectors 
also as being normalized to unity. 

It may be useful to recapitulate here that the eigenvalue problem for a normal 
operator can be expressed in any one of four equivalent ways: 


A\A’ t ) =A' i \A\) 
(A’j |A f = A’^jA’i 


A^A’i) = a;*|aj> 

(Aj\A =A\<A\ 1 


Note that the last two relations follow from the first two, or vice versa, only for 
normal operators. 

2. The Calculation of Eigenvalues and the Construction of Eigenvectors. Al- 
though we have demonstrated the orthogonality of the eigenvectors of normal op- 
erators, we have not established that any solutions to the eigenvalue problem (10.1) 
actually exist, nor have we yet found a method for calculating the eigenvalues of A 
and for determining the corresponding eigenvectors. Our hope is not only that so- 
lutions exist, but that in an n-dimensional vector space there are n orthogonal 
eigenvectors so that a complete basis can be constructed from them. We simulta- 
neously attack both problems — the existence of solutions of (10.1) and the program 
for obtaining them. 

If a basis is introduced in the space, the representation of the eigenvalue prob- 
lem (10.1) takes the form of a matrix equation: 


^ An 

a 12 

A\„ ^ 

f Xi\ 


lxi\ 

A 2 i 

A 22 

A 2n 

x 2 

= A 

x 2 

■ 

\A„ i 

A n 2 

A nn 1 

\Xn) 


\x«l 


The matrix elements x u x 2 , . . . , x„ are the components of the eigenvector which 
belongs to the eigenvalue A. Equation (10.12) is a set of n linear homogeneous 
equations that possess nontrivial solutions only if 

A n — A A 12 

A 2 i A 22 ~ A 

A„ i A n 2 

or 

det(Ay - A 8y_ 

This equation of the nth degree in the unknown A is called the secular or charac- 
teristic equation. The roots of (10.13), A = A', are the eigenvalues of A. 



- 0 (10.13) 
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According to the theorems of linear algebra, the existence of at least one so- 
lution of the set of homogeneous equations is assured. We may thus substitute A = 
A [ in (10.12) and solve the set of n homogeneous equations to obtain an eigenvector 
| A[). We then change to a basis that includes |AJ) as one of its elements. Because 
of the properties, 

A|AJ) = A[\A[) and <A{|A=A«AJ| 


valid for a normal operator, the normal matrix representing the operator A in this 
new representation takes the partially diagonalized form: 


Mi 

0 

0 

... _0 

0 

A22 

A23 

* ■ ■ a 2 „ 

0 

A32 

A3 3 

■ • ■ A 3n 

\° 

A„2 

* 

A„„ j 


The n— 1 dimensional matrix of the matrix elements Ay is again a normal matrix. Its 
eigenvalue problem has the same solutions as the original problem except for an 
eigenvector belonging to the eigenvalue A[. The same procedure as before can then 
be continued in the n— 1 dimensional subspace, which is orthonormal to | A i ). After, 
n— 1 such steps, the matrix representing A will result in the completely diagonal 
form 


IA[ 0 0 0\ 

0 A'i 0 ■ • ■ 0 

0 0 A3 • • • 0 

\0 0 0 -I’ A'} 

and the ultimately obtained basis vectors are the eigenvectors of A. This procedure, 
which is equally applicable whether or not there are repeated eigenvalues, proves 
that for a normal operator n orthonormal eigenvectors can always be found. If the 
operator is Hermitian, all expectation values and eigenvalues are real. If the operator 
is unitary, the eigenvalues have modulus unity. 


Exercise 10.2. Prove the converse proposition that an operator A whose 
eigenvectors span a complete orthonormal basis must commute with its adjoint, i.e., 
be normal. 

Exercise 10.3. Prove that the eigenvalues U\ of a unitary operator have the 
property 

\U'i\ 2 = 1 (10.14) 

In the bra-ket notation, the eigenvalue problem for the normal operator A ap- 
pears in the form 

X (KtlAlKjXKjlA't) = A'MA' t ) 


(10.15) 
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As we have seen, the eigenvectors may all be chosen to be orthonormal: 

{A'i\A[) = 8 { (10.16) 

and they form a complete set: 

n n 

X |A;><A' f | = 2 P, = 1 (10.17) 

i= 1 i= 1 

where P, = |A'-)(A‘- 1 is the projection operator for the eigenvector |A •). (A reminder: 
If an eigenvalue is repeated, an additional label is required to characterize the 
eigenvectors. More about this is detailed in Section 10.4.) 

The transformation matrix S with matrix elements 

S u = (K t \A’ e ) (10.18) 

changes the original basis | AT ; ) into one spanned by the eigenvectors of A, and it is 
unitary. The resulting form of the matrix representing A is diagonal, as is seen 
explicitly in the relation obtained from (10.15): 

2 (A' n IJC^IAI^X^IAJ) = A' ( 8 in (10.19) 

ij 

Either (10.15) or (10.19) may be used to determine the components of the 
eigenvectors of A. Quite different algebraic methods for determining eigenvalues 
and eigenvectors of certain privileged operators will be discussed later in this 
chapter. 

Although a particular representation was introduced to calculate the eigenvalues 
of A as roots of (10.13), the eigenvalues are properties of the operator and cannot 
depend on the choice of basis. Indeed, if we choose some other basis, linked to the 
previous one by the similarity transformation (9.115), 

A = S _I AS 

the new characteristic equation is 

det(A - Al) = det[S"‘(A - A 1)S] = det(A - Ai) = 0 

In this proof the property of determinants 

det AB = det A det B (10.20) 

has been used. Hence, the eigenvalues of A as defined by (10.13) are independent 
of the representation. Consequently, if we expand the characteristic equation ex- 
plicitly, 

(— A) n + (trace A)(— A)" -1 + • • ■ + det A = 0 (10.21) 

the coefficient of each power of A must be independent of the choice of represen- 
tation. 

It is easy to prove these properties for the trace and the determinant of A di- 
rectly. Since for finite dimensional matrices we know from (9.130), 

trace AB = trace BA (10.22) 

it follows that 

trace (ABC) — trac e(CAB) — trac e(BCA) (10.23) 

hence, 

trace A = trace(S~ 1 AS r ) = trace A (10.24) 
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Similarly, using (10.20), 

det A = det(S _1 AS) = det A (10.25) 

It is therefore legitimate to consider trace A and det A to be properties of the operator 
A itself and to attach a representation-independent meaning to them. 

Furthermore, from the diagonalized form of the normal operator A we see that 

trace A = A[ + A' 2 + ■ • • + A’ n = sum of the eigenvalues of A (10.26) 

and 


det A = A[ X A' 2 X • • • X A' = product of the eigenvalues of A (10.27) 
As an application, consider the matrix e A defined as 

( \ ^ 

1 + - ) (10.28) 

NJ 

The eigenvalues of e A are e A '‘. Hence, we have the useful relation 


det e A = [I e Ai 

i 


exp | 




= exp(trace A) 


(10.29) 


Exercise 10.4. Prove (10.29) directly from (10.28) without recourse to the 
eigenvalues. 


If f(z ) is an analytic function, the function B = /(A) of the normal operator A 
can be expanded in finite powers of A very simply. From the completeness relation 
(10.17) we see that 

n 

f(A) = 2 /(A ')/>,. (10.30) 

i= 1 


provided that the singularities of f(z) do not coincide with any eigenvalue of A. If 
n ' of the n eigenvalues of A are distinct, we may label these nonrepeating eigen- 
values by the subscripts i = and express (10.30) in the form 


n r n' 

m = 2 W) n 

J-l 1=1 

a*j) 


A\1 - A 

a; - a;. 


(10.31) 


showing that any function f(A) can be written as a polynomial in A of degree less 
than n. 


Exercise 10.5. Prove that a normal rc-dimensional matrix A satisfies its own 
characteristic equation (10.21), and show that A" can be expanded as a polynomial 
in A of order less than n. (The Cayley -Hamilton theorem states that this is true for 
any matrix.) 

3 . Variational Formulation of the Eigenvalue Problem for a Bounded Hermitian 
Operator. In the last section, we treated the eigenvalue problem for a normal op- 
erator as a problem in linear algebra. The characteristic equation provides a means 
of calculating eigenvalues to any desired approximation, but the task can be prohib- 
itively complicated if the dimensionality n is very large. In this section, we take a 
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different tack and assume that A, rather than being a general normal operator, is 
Hermitian. This assumption covers all observables, especially Hamiltonians. In Sec- 
tion 8.1 the variational method was already introduced as a useful tool for estimating 
the low-lying eigenvalues of the Hamiltonian operator. The Rayleigh-Ritz method 
described in Chapter 8 links the two approaches and takes advantage of the varia- 
tional principle to justify the use of an approximate characteristic equation of lower 
dimensionality. 

We employ a 1 variational principle by defining a real-valued functional 

A[^] = ao.32) 

mm 

and look for the particular ''P which minimizes (or maximizes) Aim* By dividing 
(■tp, A^) by OP, m we have made A independent of the “length” of ''P and depen- 
dent only on its “direction.” We note immediately that if 'P is an eigenvector of A, 
such that 


AT' = A OP 


then A = A\. Suppose that A has a greatest lower bound A 0 , which it assumes for 
the vector m : 


Ami = A 0 


QPo, Af 0 ) 

cm, m) 


(A 2= Ao) 


Let us calculate A for ^P = m ± ed>, where e is a small positive number and is 
an arbitrary vector. Since A 0 is the greatest lower bound, we have 

Atm ± e0>] > A 0 


Upon substitution, we obtain the result 

±e[(<&, (A - A 0 >m) + OPo, (A - A 0 )(D) ±e(<D, (A - A 0 )$)] s= 0 


Since Q = (d>, (A - A 0 )d>) > 0, we find by applying the above inequality that 

~sQ < (0>, (A - AoOPo) + CP, (A - A 0 )d> 0 ) ^ sQ 

Now let e -> 0. Then 

($>, (A - AoOPo) + (<&, (A - AoOPo)* = 0 

owing to the Hermitian nature of A. Since d> is arbitrary, we may choose it equal to 
(A — Aoim and thus conclude 

((A - AoOPo, (A - A 0 )>P 0 ) = 0 


which implies that 


Am = Ao'P 


o 


Thus, a vector m that makes A of (10.32) a minimum is an eigenvector of A, and 
A 0 is the corresponding eigenvalue. Evidently, it must be the least of all eigenvalues, 
allowing the identification A 0 = Ao if Aq ^ A{ ^ A' 2 . . . . 

We now consider a new variational problem and construct 


a m = 


op, Am 
op, m 
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where M* is the orthocomplement of 'TV 


\j/ = _ 


'PoOPq, 

(^ 0 , %) 


As 'P ranges through the n-dimensional space, 'F runs through the entire subspace 
of n- 1 dimensions, orthogonal to ^ 0 . The same argument as before gives for the 
minimum ju, 0 of p: 


A<P = p 0 V 0 


and M r must be the eigenvector belonging to the second lowest eigenvalue, 
p 0 - A[. In this manner we may continue, and we will eventually exhaust the entire 
n-dimensional space after n steps. Hence, there are n orthogonal eigenvectors. 

While the variational proof of the existence of eigenvalues as given here is 
limited to Hermitian operators, it has the merit of avoiding the explicit use of a 
representation. Also, since it makes no essential use of the assumption that n is 
finite, it can be generalized to the case n -» 00 . The generalization requires only that 
the operator A be bounded at least from one side. The operator A is said to be 
bounded if A as defined in (10.32) is bounded: + °° > A > — °°. From the Schwarz 
inequality, it follows that boundedness of A is assured if, for a given A, there exists 
a positive number C, independent of 'F, such that 


(A^, A'F) < COF, 'F) 


for every ''F. Many operators common in quantum mechanics, such as the energy, 
have only a lower bound. 


4 . Commuting Observables and Simultaneous Measurements. The physical 
meaning of Hermitian operators as candidates for representing observables motivates 
us to use basis vectors that are eigenvectors of Hermitian operators. Ideally, we 
would like to identify all basis vectors by the eigenvalues of the observable that 
supports a particular basis, and the bra-ket notation was designed with that objective 
in mind. 

Because of the occurrence of repeated eigenvalues, the eigenvalues of a single 
observable A are usually not enough to characterize a basis unambiguously, and we 
must resort to additional labels or “quantum numbers” to distinguish from each 
other the different orthonormal basis vectors that correspond to a particular eigen- 
value A'. The presence of repeated eigenvalues for a physical observable, selected 
because of its relevance to the system under study, can usually be attributed to some 
symmetry properties of the system. The example of the Hamiltonian of the free 
particle in three-dimensional space in Section 4.4 is a case in point. Owing to the 
translational symmetry of the Hamiltonian, the energy eigenvalue E > 0 alone does 
not suffice to identify an energy eigenstate. 

We look for additional observables that share common or simultaneous eigen- 
vectors with the operator A, but that are sufficiently different from A so that their 
eigenvalues can serve as distinguishing indices for the eigenvectors belonging to 
repeated eigenvalues of A. For the free particle Hamiltonian, the direction of linear 
momentum can serve as an observable that complements the characterization of the 
degenerate energy eigenstates, but other choices are possible. In the next chapter we 
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will see that for any system with rotational symmetry, of which the free particle is 
a special case, the angular momentum is the additional observable of choice. 

Confining our attention first to a single operator B in addition to A, we ask 
under what conditions two observables A and B possess a complete set of common 
eigenvectors. Such eigenvectors would then represent states in which definite sharp 
values A J- and B j can be assigned simultaneously to two observables. In an often- 
used but opaque terminology, A and B are said to be simultaneously measurable or 
compatible observables. 

Mathematically, we require that there be a complete set of states | Afif) such 

that 


a|a;b;> = aWa'iB'j) (10.33) 

and 

= B^A'iB'j) (10.34) 

both hold. If (10.33) is multiplied by B and (10.34) by A, we obtain by subtraction 

(AB ~ BA)\A'iB'j) = 0 

If this is to be true for all members of the complete set, AB — BA must be equal to 
the null operator, or 


[A, 5] = AB — BA = 0 


(10.35) 


Hence, a necessary condition for two observables to be simultaneously measurable 
is that they commute. 

We emphasize that the commutivity is a necessary condition only if all the 
eigenvectors of A are also to be eigenvectors of B. A limited number of eigenvectors 
may be common to both A and B even if the two operators do not commute. (An 
example is the state of zero angular momentum, which in Chapter 1 1 will be shown 
to be a common eigenstate of the noncommuting operators L x , L y , and L z .) 

The commutation relation (10.35) for A and B is not only necessary, but it is 
also sufficient for the existence of a common set of eigenvectors for the two oper- 
ators. To show this, we consider a particular repeated eigenvalue A J of A and its 
associated eigenvectors, which we assume to be r in number. We denote the corre- 
sponding eigenvectors by |A{, k), letting k serve as an index k = l ... r, which 
differentiates between the r orthogonal eigenvectors belonging to the same repeated 
eigenvalue A •, 

a|a;, k) = a;|a;-, k) qo.36) 

If B commutes with A, it follows that 

BA\A'i, k ) = AB\A\, k ) = A;-5|A;, k> 

This equation shows that B\A' iy k) is also an eigenvector of A with eigenvalue A\\ 
hence, 

r 

B\A’ iy k) = X |AJ, \)(A'i, A|5|A;., k> (10.37) 

A=1 


Exercise 10.6. Show that if A and B commute, B has no nonvanishing matrix 
element between eigenstates corresponding to different eigenvalues of A. 




216 


Chapter 10 Eigenvalues and Eigenvectors of Operators 


In the r-dimensional subspace of the eigenvectors belonging to A'- a change of 
basis may now be effected in order to construct a new set of eigenvectors of A, 
which are simultaneously also eigenvectors of B. They are the kets designated as 


r 


I A'iB'j) = 2 \A’ t , k)S kJ 

K=l 


(10.38) 


Here the coefficients S Kj are defined by the conditions 

r r 

B\ A'iB'j) = 22 |a;, a)<a;., A|5|a;., k)S kJ 

K= 1 A = 1 r r 

= B'jlA'iB'j) = B'j 2 2 Mka|a;-, a> 


(10.39) 


K— 1 A=1 


From (10.39) we obtain r linear homogeneous equations for the r unknown trans- 
formation coefficients S Kj : 

r 

2 ((A'i, A|B|A', k) - B'j8 XK )S kj = 0 (; fixed, k = 1, . . . , r) (10.40) 


A = 1 


This system of equations possesses nontrivial solutions only if the determinant of 
the coefficients vanishes: 


det((A ■, X\B\A'i, k) - B'j8 Xk ) = 0 (10.41) 

As in Section 10.2, the r roots of this characteristic equation give us the eigenvalues 
B[, B' 2 , . . . , B' r . Equations (10.40) can then be used to calculate the transformation 
coefficients S Kj . The new vectors | A'Bj) are the required simultaneous eigenvectors 
of both operators A and B. 

If among the r eigenvalues Bj (j — 1, . . . , r) there remain any that are repeated, 
then a further index may be used to distinguish those eigenvectors that have the 
same values A - and Bj in common. One then continues the procedure of choosing 
additional Hermitian operators C, D, ... , all commuting with A and B as well as 
with each other. If the choice is made intelligently, it will eventually be possible to 
characterize all n basis vectors in the space by addresses composed of sets of 
eigenvalues A • Bj, C' k , ... . 

If we can find a set of commuting Hermitian operators A, B, C, . . . , whose 
n common eigenvectors can be characterized completely by the eigenvalues A •, Bj, 
C' k , . . . , such that no two eigenvectors have exactly identical eigenvalue addresses, 
this set of operators is said to be complete. We assume that 

{A'i B'j C' k ... (a; B' q C' s . . .) — 8 ip 8 jq 8 ks ... (10.42) 

but often we write for this simply the compact orthonormality condition 

(K t \K m ) = 8 (m (10.43) 

The operator K here symbolizes the complete set A, B, C, . . . , and K e (omitting the 
prime for simplicity) is a symbol for the set of eigenvalues A'-, Bj, C' k , .... In 
particular, K e = K m means that in (10.42) each pair of eigenvalues satisfies the 

equalities: A • = A' p , B'j = B' q , C' k - C' s In the rare case that all eigenvalues of 

A are simple (no repeated eigenvalues), A alone constitutes the complete set of 
operators K — A. This simple situation was implicitly assumed to hold when we 
introduced bras and kets in Section 9.4. 
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In letting K symbolize the entire complete set of commuting operators, care 
must be taken to interpret sums over all sets of eigenvalues of K properly. For 
example, the completeness of the eigenvectors is expressed as the closure relation 

2^ = 2 \K,)(K e \ = 1 (10.44) 

e ( 

where 

s P e = \K e )(K e \ = |A' ( B'j C k . . Bj C' k ...\ (10.45) 

is the projection operator for K e . These equations can be used to reformulate the 
eigenvalue problem for one of the operators in the set symbolized by K. For example, 
we have 

A\K e ) =A' e \K { ) 

and hence, using the closure relation: 

A = 2 A I K e )(K t | = 2 A ;• I K e )(K t | (10.46) 

e e 

The eigenvalue problem for A can thus be expressed as follows: 

Given a Hermitian (or more generally, a normal ) operator A, decompose the 
space into a complete set of orthonormal vectors |/sf £ ) such that A is a linear 
combination of the projection operators \ K e )(K ( \. The coefficients in this ex- 
pansion are the eigenvalues of A. 

If the partial sum of all projection operators, which correspond 
value A'i, is denoted by 

P*i= 2 \K,){K,\ 

K^A'i 

we may write 

A = 2 AI-Fa;., 2 P A'i = 1 

i i 

The sums in (10.48) extend over all distinct eigenvalues of A. Equations (10.48) 
define what is called the spectral decomposition of the Hermitian operator A. This 
form of the problem is convenient because the operators P A • are unique, whereas the 
eigenvectors belonging to a repeated eigenvalue A ■ contain an element of arbitrar- 
iness. A one-dimensional projection operator like (10.45) is said to have “rank one” 
to distinguish it from higher rank projection operators like (10.47). 

For a function f(A) we may write 

/(A) = 2 f(A'i)\K t )(K e \ = 2 m)P A , (10.49) 

€ i 

Since this sum extends only over distinct eigenvalues of A, this is the same equation 
as (10.3 1). 1 

5. The Heisenberg Uncertainty Relations. We have seen that only commuting 
observables can in principle be measured and specified with perfect precision si- 
multaneously. If A and B are two Hermitian operators that do not commute, the 
physical quantities A and B cannot both be sharply defined simultaneously. This 


to the same eigen- 

(10.47) 

(10.48) 


For a discussion of functions of operators and matrices, see Merzbacher (1968). 
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suggests that the degree to which the commutator [A, B ] = AB - BA of A and B is 
different from zero may give us information about the inevitable lack of precision 
in simultaneously specifying both of these observables. We define 


AB - BA = iC 


(10.50) 


The imaginary unit has been introduced to ensure that C is a Hermitian operator. 


Exercise 10.7. Prove that C is an Hermitian operator. 


The uncertainty A A in an observable A was first introduced qualitatively and 
approximately in Section 2.2. To make it precise, we now define the uncertainty A A 
in A as the positive square root of the variance. 


and similarly, for B, 

(AA) 2 = ((A — (A)) 2 ) - (A 2 ) - (A) 2 

(10.51) 


(AB) 2 = ((B-(B)) 2 ) = (B 2 ) - (B) 2 

(10.52) 

For these quantities we 

will prove that 



(AA)\AB) 2 > Q <C)J 

(10.53) 


or 


AAA2?>i|<0| 


(10.54) 


Proof. Since A is Hermitian, we can write 

(AA) 2 = ((A-(A)m (A-(A)m 
if 'P is the state vector of the system. Similarly, 

(AB) 2 = ((B-(B))V, (B-(B))V) 

We now apply the Schwarz inequality (9.75) by making the identifications 

Tr a = (A-(A)TP and % = (B~(B))T 

and get 

(AA) 2 (AB) 2 > I CP, (A-(A))(5-(B»?)| 2 
The equality sign holds if and only if 

(B-(B))V = A(A— (A)pP 

Now we use the simple identity, based on the decomposition (9.84), 

(A— <A»0M*» = <*-<*>*»-<»» + (B-WXA-W) + L c 

2 2 


(10.55) 

(10.56) 


2 


(10.57) 
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by which the operator on the left is written as a linear combination of two Hermitian 
operators F and C. Since their expectation values are real, we can write (10.55) as 

(AA) 2 (A5) 2 > (F) 2 + ^ (C) 2 > ~ <C> 2 

which proves the theorem (10.53). The last equality holds if and only if 

(F) = 0 (10.58) 

It is of interest to study the particular state 'P for which (10.53) becomes an 
equality: 


AAAfl = 1 1 <0 1 

Such a state obeys the conditions (10.56) and (10.58). From (10.56) we can obtain 
two simple relations: 

OP, (A — (A))(5— (B))^P) = A(A A) 2 

OP, ( B-(B))(A-(A)m = ~ (A Bf 

A 

Adding the left-hand sides yields 2(F); hence by (10.58) 

A(A A) 2 + { (A Bf = 0 (10.59) 

A 


Subtracting the left-hand sides gives /(C); hence 


A(A A) 2 - \ (A Bf = i(C ) 
A 


From (10.59) and (10.60) we obtain 


A = 


/(C) 


2(A A) 2 

As a special case, let A — x, B = p x . Then, as in (3.47), 

XPx ~ Px x — till 


(10.60) 


(10.61) 


(10.62) 


and hence C=hl. The right-hand side of (10.54) is independent of the state 'P in 
this case, and we conclude that for all states 


A A ^ 

AxA Px > - 


(10.63) 


making specific and quantitative the somewhat vague statements of Chapter 2 and 
elsewhere. The Heisenberg uncertainty relation is thus seen to be a direct conse- 
quence of the noncommutivity of the position and momentum operators. 

Applied to A = x, B = p x , in the coordinate representation the equation (10.56) 
becomes a differential equation for the wave function i ft representing the special 
states 'P that make (10.63) an equality: 

AxA p x = 1 h 


(10.64) 
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The equation for t/f is 


T“- (Px) )<A = ^777^ C* - 


'ft 

, i dx 


2(Ajc ) 2 


and has the normalized solution 

</<x) = [2 it (Ax) 2 ] - 1/4 exp 


(x ~ (x)) 2 i{p x )x 


4(Ax) 2 


ft 


(10.65) 


( 10 . 66 ) 


Known somewhat imprecisely as a minimum uncertainty wave packet, this state 
represents a plane wave that is modulated by a Gaussian amplitude function. Since 
A is imaginary, the expression (10.66) is, according to (10.56), an eigenfunction of 
the non-Hermitian operator p x — Ax. As such, the state represented by (10.66) is 
known as a coherent state, with properties that will be discussed in Section 10.7. 


Exercise 10.8. Relate Ax to the mass and frequency of the harmonic oscillator, 
of which (10.66), with particular values of (x) and (p x ), is the ground state wave 
function. 


6. The Harmonic Oscillator. Although the harmonic oscillator has been dis- 
cussed in detail in Chapter 5, it is instructive to return to it and to treat it here as 
an application of algebraic operator techniques. Instead of using the more traditional 
tools of differential equations, special functions, and integral representations, these 
methods exploit the commutation relations among operators and operator identities 
shown in Chapter 4. 

The generalized linear harmonic oscillator is a system whose Hamiltonian, ex- 
pressed in terms of two canonical observables p and q, is given by 



A 2 2 

- mco q 
2 



ha) 

T 


(10.67) 


where the Hermitian operators p and q satisfy the canonical commutation relation. 


qp — pq = ihl 


( 10 . 68 ) 


Both p and q have continuous spectra extending from to +°°. 

We first consider the eigenvalue problem of H, because it will give us the 
important stationary states of the system. It is convenient to introduce a new operator 



(10.69) 


which is not Hermitian. By use of the commutation relation, we prove easily that 


H = ftwf a? a + r I — fto>| aa 




(10.70) 


where o' is the Hermitian adjoint of a: 


a 


.t _ 



q - i 


. P 


m(o. 


(10.71) 
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The commutator of a and a T is 



(10.72) 


which shows that a is not even normal. Since by (10.70) H is a linear function of 
a f a, the eigenvectors of H and of a f a are the same, and it is sufficient to solve the 
eigenvalue problem for a f a. Expressing the eigenvalues as A„ (n = 0, 1, 2, . . .) and 
the corresponding eigenvectors by 


we have 


or 


Vn = | n) 

(10.73) 

a f a% = \ n V n 

(10.74) 

a*a\n) — A„ | n) 

(10.75) 


This is the equation that we must solve. 

First, we prove that A„ > 0. From (10.74) we get 

(*„ , a'c£9 n ) = A„ (% , ¥„) or (a% , aV n ) = A„ (¥„ , ) 

Since (9.35) holds for all vectors, we conclude that 


A„ > 0 (10.76) 

If | n) is an eigenvector of a r a, then a f \n) is also an eigenvector, as can be seen 
from the equation 

(a f a)a t |n> = a t (a t a+l)|n) = (A„ + l)a t |n) 

where the commutation relation (10.72) has been used. Hence, a?\n) is an eigen- 
vector of a f a, with eigenvalue A„ + l. Similarly, we can show that a\n) is also an 
eigenvector of a f a with eigenvalue A„ — 1 . These properties justify the designation 
of a T as the raising operator and a as the lowering operator. By applying these 
operators repeatedly, we can generate from any given eigenvector | n) new eigen- 
vectors with different eigenvalues by what is graphically called a ladder method. 
However, condition (10.76) limits the number of times a lowering operator can be 
applied. When by successive downward steps an eigenvalue between 0 and 1 has 
been reached, by applying a again we do not obtain a new eigenvector, because that 
would be an eigenvector whose eigenvalue violates the restriction (10.76). Since we 
have arbitrarily (but conveniently) labeled the lowest step in the ladder by setting 
n = 0, we obtain 


a + a|0) = A 0 1 0), 1 > A 0 > 0 

and 


Consequently, 


a|0> = 0 


(10.77) 


A 0 = 0 

and this is the only eigenvalue below unity. 


(10.78) 
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Starting from |0), we obtain all other eigenvectors and eigenvalues by repeated 
application of the raising operator a t . The eigenvalues increase in unit steps. Hence, 

| «) - N n { a y\Q>) (ft = 0, 1, 2, . . .) (10.79) 

and 

A „ = n (10.80) 

The normalization constant N n must yet be determined. There is no degeneracy as 
long as no dynamical variables other than p and q appear to characterize the system. 
The set of eigenvectors obtained is complete. Combining (10.70), (10.75), and 
(10.80), we obtain 

H\n) = fico^n + -^|n) (10.81) 

Hence, the eigenvalues of the Hamiltonian are 

(10.82) 

in agreement with the discrete energy eigenvalues found in Chapter 5. 

Exercise 10.9. What are the eigenvalues of the kinetic and the potential energy 
operators of the harmonic oscillator? Explain why these don’t add up to the (discrete) 
eigenvalues of the total energy. 

Since its eigenvalues are all the nonnegative integers, the operator N = a f a 
plays a central role when the number of (identical) particles is an observable of a 
system, and it is then called the number operator. The notion of a particle as an 
excitation of an oscillator-like dynamical system (such, as an electromagnetic field 
or an elastically vibrating body) has been at the core of quantum physics from the 
beginning. An excited energy level of the harmonic oscillator with quantum number 
ft is interpreted as corresponding to the presence of n particles or quasiparticles, 
each carrying energy hco. These particles or quanta are named phonons , excitons, 
photons, and so on, depending on the physical context in which the system is rep- 
resented by a harmonic oscillator. The eigenstate 1 0), which must not be confused 
with the null vector 1 0), is variously known as the ground state, the vacuum state, 
or the no-particle state of the system. For more detail about the quantum mechanics 
of identical particles, see Chapter 21. 

The ladder property of the lowering and raising operators a and a f , and the 
orthonormality of the states | ft), leads us to conclude that the matrix elements of a 
and connect only neighboring basis states: 

(ft — l|a|ft) — C n (10.83) 

(n + l|a f |ft) — («|a[ft + 1)* = C* +1 (10.84) 

To evaluate C n we may use the closure relation 

X \ n ')( n '\ = 1 

n ' —0 



(10.85) 
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as follows: 


Thus, 


\C„\ 2 = (n — l\a\n)*(n — l|<z|n) = (n\ a f \n — 1 )(n — l\a\n) 
— 2 (n\a : \n'){n'\a\n) — { n\a t a\n ) = n 


C n = Vn e ia " 


( 10 . 86 ) 


Since there is no other restriction on the matrix elements, a n = 0 for all n is a 
possible and consistent choice for the phase. We may therefore write 

a\n) — \n — 1 ){n — l|a|n) = Vn|« — 1) (10.87a) 


a*\n) — \n + 1)(« + l|a f |n) = Vn + 1 1 n + 1) 
From here it follows that the normalized eigenkets of a ] a are 

= \n) = {n\y v \a') r "V 0 - (n!)' 1 ^)" 1 0) 


(10.87b) 


( 10 . 88 ) 


In the representation spanned by the basis vectors |n), the matrix H is diagonal and 
given by 


1 0 0 • 

* 1 ° 3 0 • 

flOD 

H = ~— 0 0 5- 

2 


(10.89) 


The matrices representing a and are 

/0 VI 0 0 • •' 

[0 0 V2 0 • • 

a = 0 0 0 V3 • • 


(10.90a) 


I 0 0 0 

VI o o 

0 V2 0 

0 0 V3 


(10.90b) 


Exercise 10.10. Using the matrices (10.90a) and (10.90b), verify the com- 
mutation relation aa t — a? a = 1. 


The coordinate q of the oscillator can be expressed as 

q = fjL. (a + a t) 


(10.91) 
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and its matrix in the | n) basis is 


/ o 



VI 

0 

V2 


0 

V2 

0 


(10.92) 


An eigenstate of the coordinate q with eigenvalue q' is represented by a column 
matrix, and the eigenvalue equation for q appears in the form 



/ o vT o 
VT o V 2 
0 \/2 0 

\ . . . 


\M 


h\ 

r 

Ci 


'Cl 

C 2 

= q' 

C 2 

ill 


w 


(10.93) 


where the components of the eigenvector of q are the transformation coefficients 

c„ = (n\q') (10.94) 

Equation (10.93) leads to a set of simultaneous linear equations: 

VI Ci 


2mo) 


h 


q c 0 


V2 c 2 + VT c 0 


2 mo) 


h 


q Ci 


Vn + 1 c n+ i + Vn c„_i = 


2 ma) 


h 


q Cn 


These simultaneous equations are solved by 

c n (q') = 2 -<" l2 Xnir V2 H n q^c 0 (10.95) 

by virtue of the recurrence relation, 

H n + i(x) - 2 xH n (x) + 2 nH n _i{x) = 0 
for Hermite polynomials. 

The closure condition (10.85), represented in the form 


X (q'\n)(n\q") = 8(q' - q") 

n= 0 

finally determines the constant factor in (10.95). The result is 



(10.96) 


in agreement with Eq. (5.39) 
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Exercise 10.11. Verify the recurrence relation for Hermite polynomials from 
the formula (5.35). 


Exercise 10.12. Transcribe Eqs. (10.77) and (10.88) in the coordinate ( q ) 
representation and calculate (q 1 \ n) from these differential relations. Using the math- 
ematical tools of Section 5.3, verify Eq. (10.96). 


7. Coherent and t Squeezed States. The general state of an harmonic oscillator 
can be expressed as a superposition of the energy eigenstates |n). A class of states 
that is of particular importance consists of the eigenstates of the non-Hermitian 
lowering operator a, with eigenvalue a 



(10.97) 


A trivial solution of this equation is the ground state |0) for a — 0, as seen from 
(10.77). 

The unitary shifting or displacement operator 


D a = e aaf ~ a * a 


(10.98) 


causes a shift of the operator a, since from Eq. (3.58) we see that 

DlaD a = e -^^*a ae ^- a * a = a + a (10.99) 

for an arbitrary complex number a. We deduce from (10.98) and (10.99) that D a 
has the properties 

Dl = D~ l = D_ a (10.100) 

and 

aD a \ 0) = aD a | 0} 

This result shows that the solution of the eigenvalue problem (10.97) may be taken 
to be 


|a) = D a |0) (10.101) 

and that all complex numbers are eigenvalues of the operator a. Since D a is unitary, 
the eigenket |a) in (10.101) is normalized, 

<a|a> - <0|0> = 1 (10.102) 

Using (3.61) and (10.77), this eigenket can be expressed as 


a) = e “ at- “* a |0> = <rl“l 2/ V* flt |0> 


(10.103) 


These eigenstates of the lowering (annihilation) operator a are known as coherent 
states. Their relation to the minimum uncertainty wave packets (10.66) will be 
brought out shortly. For some purposes it is instructive to depict the eigenvalue a 
of a coherent state as a vector in the two-dimensional complex plane (see Argand 
diagram, Figure 10.1). It is interesting to note that the coherent states are normalized 
to unity, even though the eigenvalue spectrum of the operator a is continuous. 2 


2 The first comprehensive treatment of coherent states was given by Glauber (1965). 
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Figure 10.1. Two-dimensional phase space or Argand diagram representing a coherent 
state | a) in terms of the eigenvalue a of the lowering or annihilation operator a. Also 
shown is the effect of a “displacement” D p and a “rotation” R x on the coherent state | a). 


Exercise 10.13. Using the property (10.99), show that for any coherent state 

I a), 

a) = C\a + (3) (10.104) 

where | a + (3) is again a coherent state and C is a phase factor. Interpret this result 
in terms of the complex eigenvalue plane (Figure 10.1). 


A second useful unitary operator is 

R K = e iXafa (10.105) 

with a real-valued parameter A. Since [a f a, a] = —a, th'e identity (3.59) gives 


R\ a R K — e 


— i\afa a e *aU = g iA fl 


(10.106) 


Exercise 10.14. Show that for any coherent state | a), 

Rx\a) = C'\e iX a) 

where | e lA a) is again a coherent state and C' is a phase factor. Interpret the meaning 
of this result in the complex eigenvalue plane (Figure 10.1). 

There is an eigenstate |a) of a for any complex number a, but the coherent 
states do not form an orthogonal set. The inner product of two coherent states | a) 
and [ /3) is 

(j3| a) - e~l a ! 2/2- l /* l 2/2 (0 1 e P* a e aa ^ j o) = ^-|“l 2/2 -|' 3 i 2/2 (0|e“ a+ ^* a ^*|0) (10.107) 

= e -|«| 2 /2-|P| 2 /2+a:/3* 


and 

l(/3|a)| 2 = e~> a ~ 


(10.108) 
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Hence, the distance | a — /3 1 in the complex eigenvalue plane (Figure 10.1) measures 
the degree to which the two eigenstates are approximately orthogonal. 

To expand the coherent state | a) in terms of the energy or the number-operator 
eigenstates |n), we calculate 


a) = 2 |«)<«| or) = 2 I n)(n\e M 2/ V*« + ]o) 


=.X \r.)e~^(n\ 2 ^|0> = £ \n)e~^^= 

n~0 k=0 Ki n= 0 V»! 


(10.109) 


In the last step, Eq. (10.88) and the orthonormality of the energy eigenstates have 
been used. The probability P n (a ) of finding the coherent state | a) to have the value 
n when the operator aja is measured is thus given by the Poisson distribution : 


P n (a) = e-H 2 


a 

2 n 

n 

! 


( 10 . 110 ) 


The mean (expectation) value of n for this distribution is |a| 2 . 


Exercise 10.15. Evaluate the integral 

f j </3 1 ct> | 2 d 2 /3 


over the entire complex /3 plane, and interpret the result. How can this be reconciled 
with the probability doctrine of quantum mechanics? (See Section 15.5.) 

Exercise 10.16. By requiring that |a) = 2 |rc)(n|a) is an eigenket of the 

n 

operator a, with eigenvalue a, obtain a recurrence relation for (n\a). Verify 
(10.109). 

As we might expect from the lack of restrictions imposed on the eigenvalues 
and eigenstates of a, the latter form an overcomplete set. An arbitrary state can be 
expanded in terms of them in infinitely many different ways. Even so, an identity 
bearing a remarkable similarity to a closure relation can be proved: 




a) dRe(a ) cftm(a) (a\ = 1 


( 10 . 111 ) 


Here, the integration is extended over the entire a plane with a real element of area. 


Exercise 10.17. Prove Eq. (10.111). This is most easily done by expanding 
the coherent states in terms of the harmonic oscillator eigenstates, using (10.109) 
and plane polar coordinates in the complex a plane. 

Exercise 10.18. Prove that the raising operator has no normalizable eigen- 
vectors, and explain the reason. 


An arbitrary state of a system, which has the coordinate q as its complete set 
of dynamical variables, can be written, on account of (10.88), as 
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where c n - (nl'VP') and F(a f ) is a convergent power series (entire function). Hence 
<a|¥> = {a | F(af) | 0) = F(a*)<a|0) - e~' a f /2 F(a*) (10.112) 

The entire function F(a*) thus represents the state 'P. 

Exercise 10.19. What function F{a*) represents the coherent state |j8>? 

The action of a normally ordered operator on an arbitrary state can be expressed 
conveniently in terms of this representation. An operator is normally ordered if, by 
use of the commutation relations, all lowering operators have been brought to the 
right of all raising operators. For example: 

A — a 2 a + = a( 1 + a?a) — a + (1 + 0^0)0 = 2 a + a^a 2 

shows how normal ordering is achieved. The expectation value {a\A{a^ , a)| a) of a 
normally ordered operator is 

(a\A(a f , a)\a) = A(a*, a) = A((a t ), (a)) (10.113) 

For example, in a coherent state |a) the expectation value of a product of normally 
ordered operators, like ( a t ) n a m , can be factored: 

<a|(a + )V|a> = a* n a m = (a\a*\a) n (a\a\a) m (10.114) 

and written as a product of expectation values of and a. In general, such factor- 
izations are not permissible in quantum mechanics, but coherent states enjoy the 
unusual property of minimizing certain quantum correlations (or quantum fluctua- 
tions). This has led to their designation as quasiclassical or semiclassical states. The 
term coherent reflects their important role in optics and quantum electronics (Section 
23.4). 

Exercise 10.20. For a coherent state | a), evaluate the expectation value of the 
number operator a f a, its square and its variance, using the commutation relation 
(10.72). Check the results by computing the expectation'values of n, n 2 , and (An) 2 
directly from the Poisson distribution (10.110). 

Consider an operator A(a f , a) which is normally ordered, and let it act on an 
arbitrary ket 

A(a\ a)|¥> = A(a\ a)F(a f )\0) = A^a\ ^F(a f ) |°> (10.115) 

where d/daf denotes formal differentiation. This last equation follows from the com- 
mutation relation 


a(fl t r - (a f ) n a = n(a t ) n “ 1 - -7 (a f ) n 

da T 

and the property a|0) = 0. Hence, we infer that the entire function 

A { a *’ 

represents the state A j ^ r ) in the same sense as, according to (10.112) F(a*) repre- 
sents the state [M*'). 
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Exercise 10.21. Rederive the function F(a*) which represents the coherent 
state |/3) by letting A — a and requiring a|/3) = /3|/3). 

Exercise 10.22. Choose A — a f a, the number operator, and obtain the entire 
function F(a*) which represents its eigenkets. Verify that the eigenvalues must be 
nonnegative integers. 


By definition ‘(10.69), the non-Hermitian operators a and a f are related to Her- 
mitian canonical coordinate and momentum operators as 


ft 


(a f + a), p — i 


mho) 


( a f — a) 


" ^ v 2 

The expectation values of q and p in a coherent state | a) are 

I fh I 2 ft 

(a | q I a) — / (a + a*) — / Re a 

y 2 m(o y mao 

and 


(10.116) 


IftlTlO) 

(a: |p | a) = i / — - — (a* — a) = \f2hmoi Im a 

Furthermore, taking advantage of normal ordering, we can calculate 

h 


(10.117) 


(a|^ 2 |o:) = {ot\a z + a n + a f a + l|a) 


2m co 
h 

2 mco 


[(or + a*) 2 + 1] = {a\q\a) 2 + 


h 


2 mto 


and similarly 


(a\p 2 \a) = (ot\p\a) 2 + 


fimai 


The last terms on the right-hand side are proportional to h and exhibit the quantum 
fluctuations in q and p. The variances are 


(A qf = (a |# 2 1 a) - (a|^|o :) 2 = 
(Ap) 2 = (a|p 2 | a) - («|p|a) 2 - 


h 


2ma> 

hmo) 


(10.118) 

(10.119) 


so that 


A^ • Ap = | 


(10.120) 


showing that in the coordinate and momentum language the coherent states are min- 
imum uncertainty (product) states. 

As discussed in Section 9.6, in the coordinate representation the eigenvalue 
condition 
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is transcribed as 


q + ~~ ~ )<^l a > = a (o\a) 

V ma> 


2mo) dq / 

This differential equation has the same form as (10.65) and the solution 


( 10 . 121 ) 



h 


q\/2fim(o Im a 


( 10 . 122 ) 


In this form, known to us from (10.66), coherent states have been familiar since the 
early days of quantum mechanics. 


Exercise 10.23. Compute the normalization factors C' and C in (10.122) and 
show how they are related. 


For a fixed oscillator mode, specified by a given value of mo), the coherent 
states are the manifold of those minimum uncertainty states that have definite values 
for A q and A p, given in Eqs. (10.118) and (10.119). (If mco = 1, the uncertainties 
in q and p are both equal to \/hl 2.) We can construct other minimum uncertainty 
states with narrower width A q, so-called squeezed states, for the same oscillator by 
defining a new set of raising and lowering operators 



(10.123) 

(10.124) 


using an arbitrarily chosen positive parameter o>' . Obviously 


[b, b f ] = 1 (10.125) 

The operators b, b* can be expressed in terms of a and a f by substituting (10.116) 
into (10.123) and (10.124): 


or 



(10.126) 


b = Aa + va r 
b f = + va 


(10.127) 


where A and v are two real numbers that are related by the condition 

A 2 - v 1 = 1 (10.128) 


(with A > 1). 
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Exercise 10.24. Verify (10.128). 


In the language of bosons (Chapter 21), the transformation (10.127), which 
generally does not preserve the number of particles (since tfb + afa), is referred to 
as a quasiparticle transformation. The operator b' creates a quasiparticle, which is 
a superposition of particle states, and b annihilates a quasiparticle. 

If (10.127) is inverted, subject to the restriction (10.128), we have 


a = kb — vb T 
a + — AZ? 1 — vb 


(10.129) 


The eigenstates of the lowering operator b are defined by 


b |/3) = 0|0> (10.130) 

From the relations (10.123) and (10.124) it is apparent that these states are minimum 
uncertainty states for p and q, but the uncertainties of these quantities are determined 
by cd’, and not by a ) : 


so that 


(A qf 

(A pf 


h 

2 moi' 
hma)' 
2 


A 

2moj 

hmco 


(A 

(A 


- vf 
+ vf 


(Sq)\\p? = j 


(10.131) 


as it should be for a minimum uncertainty state. Although co is fixed, the uncertainty 
in either q or p can be controlled in these states by changes in the parameter &>'. 
Since it is possible, for instance, by choosing very large values (o' » co to reduce 
A q arbitrarily at the expense of letting A p grow correspondingly, these states have 
been named squeezed states in quantum optics. 


Exercise 10.25. For a squeezed state | /3) verify the values of t±q and A p given 
in (10.131). 

Exercise 10.26. Prove that the operators a and b are related by a unitary 
transformation 


b = Ualf 


where 

U = g«a 2 -« T2 )/2 (10.132) 

and e £ — \ + v. Show that U transforms a coherent state into a squeezed state. 
[Hint: Use identity (3.54).] 


Problems 

1. Carry out numerical integrations to test the uncertainty relation AxA k x s 1/2 for the 
wave packets defined by Figures 2.1 and 2.2. 
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2. Assuming a particle to be in one of the stationary states of an infinitely high one- 
dimensional box, calculate the uncertainties in position and momentum, and show 
that they agree with the Heisenberg uncertainty relation. Also show that in the limit 
of very large quantum numbers the uncertainty in x equals the root-mean-square 
deviation of the position of a particle moving in the enclosure classically with the 
same energy. 

3. Calculate the value of AxA p for a linear harmonic oscillator in its nth energy eigen- 
state. 

4. Using the uncertainty relation, but not the explicit solutions of the eigenvalue prob- 
lem, show that the expectation value of the energy of a harmonic oscillator can never 
be less than the zero-point energy. 

5. Rederive the one-dimensional minimum uncertainty wave packet by using the vari- 
ational calculus to minimize the expression I — (Ax) 2 (A p) 2 subject to the condition 


J | if/\ 2 dx = 1 

Show that the solution f of this problem satisfies a differential equation which is 
equivalent to the Schrodinger equation for the harmonic oscillator, and calculate the 
minimum value of AxA p. 

6. The Hamiltonian representing an oscillating LC circuit can be expressed as 


Q 2 <f> 2 
H = + — 

2C 2 L 


Establish that Ham i lton’s equations are the correct dynamical equations for this sys- 
tem, and show that the charge Q and the magnetic flux <I> can be regarded as canon- 
ically conjugate variables, q, p (or the dual pair p, —q). Work out the Heisenberg 
relation for the product of the uncertainties in the current 7 and the voltage V. If a 
mesoscopic LC circuit has an effective inductance of L = 1 /u,H and an effective 
capacitance C ~ 1 pF, how low must the temperature of the device be before quantum 
fluctuations become comparable to thermal energies? Are the corresponding cur- 
rent-voltage tolerances in the realm of observability? 

, i 

7. If a coherent state | a) (eigenstate of a) of an oscillator is transformed into a squeezed 
state by the unitary operator 


U = exp 



calculate the value of £ that will reduce the width of the Hermitian observable 
(a + a t )/V 2 to 1 percent of its original coherent-state value. What happens to the 
width of the conjugate observable ( a — a f )/\^2i in this transformation? 



CHAPTER 11 


Angular Momentum. 

in Quantum Mechanics 


We now turn to the motion of a particle in ordinary three-dimensional 
space. Bohr found the key to the theory of electronic motion in the 
Coulomb field of the nucleus in the quantization of angular momentum 
(in units of Planck’s constant divided by 2tt). Beyond its relevance to the 
classification of energy levels in central-force systems, the study of orbital 
angular momentum brings us one step closer to a detailed exposition of 
symmetry in quantum mechanics (Chapter 17). 


1. Orbital Angular Momentum. Central forces are derivable from a potential 
that depends only on the distance r of the moving particle from a fixed point, usually 
the coordinate origin. The Hamiltonian operator is 



V(r) 


2m 


V 2 + V(r) 


( 11 . 1 ) 


Since central forces produce no torque about the origin, the orbital angular 
momentum 


L = r X p 


( 11 . 2 ) 


is conserved. In classical mechanics this is the statement of Kepler’s second law. 

According to the correspondence principle, we must expect angular momentum 
to play an equally essential role in quantum mechanics. The operator that represents 
angular momentum in the coordinate representation is obtained from (11.2) by re- 
placing p by (h/i)V: 


L = 



V 


(11.3) 


No difficulty arises here with operators that fail to commute, because only prod- 
ucts like xp y , yp z appear. 

In view of the great importance of angular momentum as a physical quantity, 
it is well to derive some of the properties of the operator L, using the basic com- 
mutation relations between the components of r and p and the algebraic rules of 
Section 3.4 for commutators. For example 

[L x , y ] = [yp z - zp y , y] = ~z[p y , y] = ihz 
[L x ,p y \ = [yp z - zp y ,Py ] = [y, P y \p z = ihPz (H.4) 

[L z , x ] = 0, [L x , p x ] = 0 
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Similar relations hold for all other commutators between L and r and between L 
and p. From these relations we can further deduce the commutation relations be- 
tween the various components of L : 

[L x , Ly] \L X , zPx %Pz\ \-L x , z]p x %\L X , p z ] 

— —ihyp x + ihxpy — ihL z 

and by cyclic permutation (x — > y — > z — * x) of this result, 


[L x 


Ly] = ihL Z 


[L y 


L z ] = ihL x , 


[L z , L x \ ihLy 


(11.5) 


Since the components of L do not commute, the system cannot in general be assigned 
definite values for all angular momentum components simultaneously. 

Insight into the nature of the angular momentum operator is gained by noting 
its connection with (rigid) rotations. Suppose f(r ) is an arbitrary differentiable func- 
tion in space. If the value f(r) of the function at point r is displaced to the new 
point r + a, where the displacement vector a may itself depend on r, a new function 
F(r) is obtained by the mapping 

/( r) l-> F( r + a) = /( r) 

For an infinitesimal displacement s, 

F(r + s) = F( r) + e • V F( r) = f(r) (11.6) 

and the change of the function f is, to the first order, 

Sf( r) - F(r) - f( r) - -8 • V F( r) = -e • V f(r) (11.7) 

A finite rotation R by an angle about an axis that points in the direction of 
the unit vector n through the origin is characterized by the displacement vector 
(Figure 11.1) 

a« = n X (ft X r)(l - cos fi) + n X r sin (11.8) 


Exercise 11.1. Verify (11.8) and show that it gives the expected answer for a 
rotation about the z axis. 


Exercise 11.2. For an infinitesimal displacement s, applied to the vector func- 
tion /( r) — r, show that Sf(r) = fir = — e. 

The inverse rotation R~ 1 by an angle —(f) about the same axis (or, equivalently 
by an angle about the unit vector — n) is described by the displacement vector 

a*-i = n X (n X r)(l — cos fi) — n X r sin (11.9) 

Because of the r-dependence of the displacement, for finite rotations generally 
a^-i + — a R . The rotation R causes the value of the function / at position 
r + a*-i to be displaced to the original position r; hence, 

Fix) = fir + a*-.) 

Exercise 11.3. If the r dependence of the displacement vector is explicitly 
indicated as a*(r), prove that 


a*(r) + a*-i(r + a*(r)) = 0 
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(a) 



Figure 11.1. Rotation about an axis defined by the unit vector ft and the rotation angle (f>. 
(a) shows the displacement a^(r) of the point whose position vector is r. (b) illustrates the 
active rotation of a function or state /( r) about an axis (ft) perpendicular to the plane of 
the figure: /( r) ^ F(r) = /( r - a*). 

If the rotation angle 8(f> is infinitesimal, (11.8) shows that the displacement £ 
can, to second order in 8cf>, be expressed as 

e = 8(f>n X r + ^ (8(f>) 2 n X (n X r) = 8<J> X r + ^ 8<{> X (8c}> X r) (11.10) 

where Sd|> = 8(f > n is a vector of length 5(f> pointing in the direction of the axis of 
rotation with an orientation defined by a right-handed screw. If the inverse rotation 
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is infinitesimal, the displacement is simply — e. For an infinitesimal rotation, the 
change of the function f is then, to first order in 8fi, 

8f = -8<f>h X r V/ = -8<f> n r X V/ (11.11) 


or 


8f= -^84>-L/ 


( 11 . 12 ) 


The operator L /h is called the generator of infinitesimal rotations. 

Equation (11.12) can be integrated for a finite rotation R about the fixed axis 
n. The result is straightforward: 

F(r) = f(r) (11.13) 

and defines the unitary rotation operator 

U R = (11.14) 


The rotation operator U R rotates a state represented by the wave function if/(r) into 
a new state represented by ift'( r) = U R if/(r). For any operator A, we define a rota- 
tionally transformed operator A' such that A’fif r) = U R Afi( r), which implies that 

A' = U R AUl = e-ww^Ae 0 '^ 1 ' (11.15) 


For infinitesimal rotations this becomes 


8A = A' - A = [A, 8<j> X r • V] = ^ [A, 8<|> • L] (11.16) 

A vector operator A is a set of three component operators whose expectation 
value in the rotated state is obtained by rotating the expectation value of A in 
the original state For an infinitesimal rotation: 

0P'|A|¥') - <¥|A|¥> = 8<J> X <¥|A|¥> (11.17) 

Keeping only terms up to first order in the rotation angle, we find that this relation 
leads for a vector operator A to the condition 

U R AUl - A = SA = — 8<j> X A (11.18) 


The operators r, p, and L = r X p are examples of vector operators. For these, 
substitution of (11.18) into (11.16) yields the commutation relation 


[A, n • L] = ihn X A 


(11.19) 


Exercise 11.4. By letting A = r, p, and L in (11.19), verify (11.4) and (11.5). 
Also check that if A = r in (11.18), the correct expression for 5r is obtained. 

Exercise 11.5. Apply to infinitesimal translations the reasoning that led to 
(11.19) for rotations, and rederive the fundamental commutation relations of r and p. 

A scalar operator S is an operator whose expectation value is invariant under 
rotation and which therefore transforms according to the rule 


8S = 0 


( 11 . 20 ) 
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It follows from (11.16) that for a scalar operator S : 


SL — LS = [S, L] = 0 (11.21) 

The scalar product A • B of two vector operators is the simplest example of a scalar 
operator, since 


5(A • B) = 5A • B + A • SB = -8<J> XAB~A8<J>XB = 0 


We note in particular that the orbital angular momentum L commutes with any 
(potential energy) function V(r) of the scalar r, with the kinetic energy p 2 Hm and 
with L 2 : 


[L, L 2 ] = 0 


( 11 . 22 ) 


Exercise 11.6. Verify that any component of L, say L z , commutes with 
L 2 = L 2 4- L 2 + L 2 by using the co m mutation relations (11.5). 

Exercise 11.7. Prove that r • L and p • L are null operators. 

Exercise 11.8. Does the equation 

L X L = ih L (11.23) 

make sense? 

Exercise 11.9. If two rotations 8<J>! and 8(|>2 are performed, in that sequence, 
to second order, the total displacement is 

a 2 i = 84>i X r + S<J> 2 X r 4- 5<f> 2 X (5<j>! X r) 

+ \ X (S<j>, X r) + i 8<j> 2 X (S<t> 2 X r) 

The displacement a 12 is obtained by interchanging the rotations 1 and 2. Show that 
the difference displacement a 21 — a 12 = (8<f> 2 X 8<}>i) X r is effected by the rotation 
S<j> 12 — 8c}> 2 X 8^! and that this rotation correctly induces a second-order difference 
i 

821 / — &12 f = ~ ” 8<b 2 X Scb ! • Lf owing to the validity of the commutation rela- 
n ‘ 

tions (1 1.23). 


In summary, we conclude that it is not possible, in general, to specify and 
measure more than one component n • L of orbital angular momentum. It is, how- 
ever, possible to specify L 2 simultaneously with any one component of L. The Ham- 
iltonian, H = p 2 l2m + V(r), for a particle moving in a central-force field commutes 
with L, and it is therefore possible to require the energy eigenstates of a rotationally 
invariant system to be also eigenvectors of L 2 and of one component of L, which is 
usually chosen to be L z . Thus, in preparation for solving the energy eigenvalue 
problem, it is useful first to derive the common eigenvectors of L z and L 2 . Just as 
the eigenvalue problem for the harmonic oscillator could be solved by two methods, 
one analytic based on differential equations, the other algebraic and starting from 
the commutation relations, here also we can proceed by two quite different routes. 
We give precedence to the algebraic method, saving the analytic approach for sub- 
sequent sections. 
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2. Algebraic Approach to the Angular Momentum Eigenvalue Problem. We 
start with three Hermitian operators J x , J y , J z , which are assumed to satisfy the same 
commutation relations (11.5) as the three Cartesian components of orbital angular 
momentum: 


X ? jy\ iflj z \Jy 5 '1 7 1 ihJ x , | 'l 7 , Jx\ ifl j y 


(11.24) 


Nothing is different here from (11.5) except the names of the operators. We have 
replaced L by J in order to emphasize that the eigenvalue problem, which will be 
solved in this section by the algebraic method, has the capacity of representing a 
much larger class of physical situations than orbital angular momentum of a single 
particle. 

Let us consider the eigenvalue problem of one of the components of J, say J z . 
We construct the operators 


'x, 

+ 

II 

4 - 

+ 

rs, 

4 " 

fx 

II 

I 

(11.25) 

and 

J 2 - Jl + Jy + J\ 

(11.26) 

Of these three operators only J 2 is Hermitian. The operator J_ 

is the adjoint of J+. 

From the commutation relations (11.24), we infer further commutation relations: 

4*' 1 

+ ^ 

1 

II 

(11.27) 

j j 7 - j z j = fty_ 

(11.28) 

J + J - J_J + = 2 ft J z 

(11.29) 

J 2 J - J J 2 = o 

(11.30) 

We note the useful identity 

J 2 ~ Jj± hJ z = J ± J T 

(11.31) 


Exercise 11.10. Prove (11.31). 


Since according to (11.30), J z commutes with J 2 , it is possible to obtain si- 
multaneous eigenvectors for these two operators. This option will help us to distin- 
guish between the various independent eigenvectors of J z . If we denote the eigen- 
values of J z by mft and those of J 2 by Aft 2 , the eigenvalue problem can be written 
as 


7 z |Am) = mft | Am) (11.32) 

J 2 |Am) = Aft 2 1 Am) (11.33) 

The eigenvalues m and A, belonging to the same eigenvector, satisfy the in- 
equality 

As : m 2 


To prove this inequality, we consider 

j 2 - j\ = j\ + jy = | (J+J- + J-J + ) = ^ (J + J\ + JU + ) 


(11.34) 
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Since an operator of the form AA f has only nonnegative expectation values, we 
conclude that 

(Am|j 2 — j\\Xm) > 0 

from which the inequality (11.34) follows. 

Next we develop again a ladder procedure similar to the method employed in 
Section 10.6 for the harmonic oscillator. If we act on Eq. (11.32) with J+ and /_ 
and apply (11.27)“ and (11.28), we obtain 

J z J+\Xm) = (m + l)hJ+ | Am) (11.35) 

J z jJ\Xm) = (m - l)hjJ\Xm) (11.36).. 

Also, 

J 2 J ± | Am) = A h 2 J ± | Am) 

! 

Hence, if |Am) is an eigenvector of J z and J 2 with eigenvalues mh and 
| Am) is also an eigenket of these same operators but with eigenvalues 
and Xh 2 , respectively. We may therefore write 

7+] Am) = C+(Xm)h\X m+1) 

7_|Am) — C~(Xni)h\X m— 1) 

where C ± (Am) are complex numbers yet to be determined. 

For a given value of A, the inequality A > m 2 limits the magnitude of m. Hence, 
there must be a greatest value of m, Max(m) = j, for any given A. Application of 
the raising operator J+ to the eigenket Xj) should not lead to any new eigenket; 
hence, 

J+ | A/) = 0 

Multiplying on the left by we obtain 

J-J+\Xj) = (J 2 - J\ - hJ z )\Xj) = (A - / - j)h 2 1 Xj) = 0 
from which the relation between j and A follows: 

A =y(7+D (11.39) 

Similarly, there must be a lowest value of m, Min(m) — j', such that 

A = j'(j' ~ 1) (11.40) 

Equations (11.39) and (11.40) are consistent only if 

j 1 = ~j or j' = 7+1 

The second solution is meaningless because it violates the assumption that j is the 
greatest and j' the smallest value of m. Hence j' = —j. 

Since the eigenvalues of J z have both upper and lower bounds, it must be pos- 
sible for a given value of A or j to reach | Xj') — | A, —j) from | Xj) in a sufficient 
number of steps descending the ladder by repeated application of the lowering op- 
erator J-. In each downward step, m decreases by unity; it follows that j — j' — 

2 j must be a nonnegative integer. Hence, j must be either a nonnegative integer or 
a half-integer, i.e., the only possible values for j are 

j ~ 2 ’ 1’ 2 ’ ' ‘ ‘ 


Xh 2 , then 
(m ± 1 )h 

(11.37) 

(11.38) 


(11.41) 
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For a given value of j, the eigenvalues of 7 Z are 

mh = jh. O' - 1 )h, ( j - 2 )h, ..., -(j ~ 1)*. “ 

These are 2 j + 1 in number, and there are thus 2 j + 1 orthogonal eigenvectors for 
every value of j. Since 2 j + 1 can be any positive number, we see that for every 
dimension it is possible to construct a vector space that is closed under the operations 
of the algebra of the three operators J x , J y , J z , which are constrained by the com- 
mutation relations. This is the key to the idea of an irreducible representation of the 
rotation group (see Chapter 17). 

With the aid of the identity (11.31), we can now determine the coefficients C ± 
in (11.37) and (11.38). Note that from (11.37), 

(Am|7_ = (Am + l|C+(Am)fr 

Multiplying this and (11.37), we get 

(Am|7_7+|Am) = |C+(A m)| 2 ft 2 (A m+1 1 A m+1) 

Let us assume that all eigenvectors are normalized to unity. Then, since 

(Am|7_7+|Am) = (Am|j 2 — J\ — hJ z \Am) 

= [ j(j + 1) — m 2 — m]ft 2 (Am[Am) 

we conclude that 

|C+(Am)| 2 = j(j + 1) - m(m + 1) = (j ~ m)(j + m + 1) 

The phases of C+ are not determined and may be chosen arbitrarily. A usual choice 
is to make the phases equal to zero. 

Exercise 11.11. Using the fact that /_ is the adjoint of J + , show that 

C_(A m) = C?(A m— 1) 

We then have 

i 

7+ | Am) = VO' — m)(j + m + 1) ^|A m+1) (11.42) 

and 

7_ | Am) = V(y + m)(j — m+1) h\A m— 1) (11.43) 

Exercise 11.12. By the use of Eqs. (11.42) and (11.43), construct the matrices 
representing J x , J y , and 7 Z in a basis that consists of the common eigenvectors of 7 Z 
and J 2 . 

Since it is impossible to specify two or more components of J simultaneously, 
it is of interest to ask what the physical implications of the noncommutivity of such 
operators as J x and J y are. If the commutation relations (11.24) are applied to the 
Heisenberg uncertainty relation (10.54), we have the inequality 

A7,A7 y >||(7 Z )| (11.44) 

Is there a state for which all components of J can be simultaneously determined, 
such that A7* = A7j, = A7 Z = 0? From (11.44) and similar inequalities, we see that 
this can be the case only if the expectation values of all components of J vanish: 

(J) = 0 
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But (A J x ) 2 = </ 2 ) - { J x ) 2 ; hence, if both A J x and (J x ) vanish, then we must also 
have 

(Jl) = 0 

and similar conditions must hold for the other components. In other words, the 
desired state has a sharp nonfluctuating angular momentum value of zero, or 

J|iA) = 0 (11.45) 

and therefore J 2 |i/r) = 0. The only solution of (11.45) is the state 
1 00) = \j = 0, m ~ 0). For all other states, quantum fluctuations make it impossible 
to specify J x , J y , and J z simultaneously. As a consequence, for all states, except the 
state |0 0), (11.34) is a proper inequality and 

j(j + 1) > m 2 

The component J z can never be as “long” as the vector J! In the vector model of 
angular momentum, in the “old quantum theory,” the states \jm) were visualized 
by circular cones centered on the z axis (Figure 11.2). 

We have thus completed the explicit construction of all the operators J which 
satisfy the commutation relations (11.24). The treatment of the eigenvalue problem 
given here has been a formal one. Only the commutation relations, the Hermitian 
nature of J, and certain implicit assumptions about the existence of eigenvectors 
were utilized, but nothing else. In particular, no explicit use was made of the con- 



Figure 11.2. A cartoon illustrating the angular momentum eigenstat es | (m) for f = 2. The 
semiclassical angular momentum vector “L” has length Vf(€ + \)h = V6&, but its 
z component assumes the possible values “L z ” = 0, ±h, ±2 h. This is visualized by 
supposing that the ‘ ‘L’ ’ vector is stochastically distributed on one of the circular cones 
with uniform probability. The mean values of “ L x '” and “L*” obtained from this model 
[zero and (6 — m 2 )h 2 / 2, respectively] agree with the expectation values of the 
corresponding miantum operators. 
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nection between J and spatial rotations, nor is J necessarily r X p. Our solution of 
the eigenvalue problem thus extends to any three operators that satisfy commutation 
relations like (1 1.24), e.g., the isospin operator in the theory of elementary particles 
and in nuclear physics. We must now return to orbital angular momentum 
L = rXp and analyze its eigenvalue problem in more detail. 


3. The Eigenvalue Problem for L z and L 2 . It is convenient to express the orbital 
angular momentum as a differential operator in terms of spherical polar coordinates 
defined by 


x = r sin 6 cos <p, y — r sin 9 sin cp , z = r cos 6 

The calculations become transparent if we note that the gradient operator can be 
written in terms of the unit vectors of spherical polar coordinates as 


V = 



l d A l d 

+ 6 — 

r sin 6 dcp r 36 


(11.46) 


where (Figure 11.3) 

r — sin 6 cos <px + sin 0 sin <p y + cos Qz. 
i p = —sin cpx + cos <py (11-47) 

A 

0 = cos Q cos cpx + cos 6 sin <p y — sin Qz 


z 



Figure 11.3. Angles used in the addition theorem of spherical harmonics. The angles a 
and (3 are the azimuth and the polar angle of the z' axis in the Cartesian xyz coordinate 
frame. They are also the first two Euler angles specifying the orientation of the Cartesian 
coordinate system x’y'z' with respect to xyz (see Figure 17.1). The third Euler angle y is 
left unspecified here, and the x' and y' axes are not shown. The projections of the z' axis 
and the vector r on the xv nlane are dashed lines. 
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Exercise 11.13. Verify (11 .46), which is done most easily by using the relation 

df = dr ■ V/ 

and considering displacements along the curves on which two of the three spherical 
polar coordinates r, cp, 9 are held fixed. 


From (11.46) it is evident that the three spherical polar components of the 
momentum operator (hli) V, unlike its Cartesian components, do not commute. The 
angular momentum may now be expressed as 



(11.48) 


From the representations (11.48), we obtain 


L 2 = L 2 + L 2 y + L 2 = -h 2 


1 d 2 1 

sin 2 9 dip 2 sin 9 



(11.49) 


Exercise 11.14. Derive (11.49) from (11.48). 

The spherical coordinate representation is particularly advantageous for treating 
the eigenvalue problem of L z : 

L z <3>(<p) = — — = (11.50) 

i dip 

where use has been made of the conclusion of the last section that the eigenvalues 
of any component of angular momentum must have the form mh, with m being 
restricted to integers and half-integers. The solutions of (11.50) are simply 

0>(<p) = e im » (11.51) 

The simultaneous eigenfunctions of L z and L 2 must then be of the form 

Y(6, cp) = <D(y)®(0) (11.52) 

What conditions must we impose on the solutions (11.52) to give us physically 
acceptable wave functions? It is implicit in the fundamental postulates of quantum 
mechanics that the wave function for a particle without spin must have a definite 
value at every point in space. 1 Hence, we demand that the wave function be a single- 
valued function of the particle’s position. In particular, <& must take on the same 
value whether the azimuth of a point is given by <p or cp + 27 r. 


'This and other arguments for the single-valuedness of the wave function are discussed in Merz- 
bacher (1962). The conclusions depend strongly on the topology of the space of the coordinates of the 
system. 
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Applied to (11.51), the condition + 2i r) = restricts us to those so- 
lutions 

<D (<p) = e'" 1 * (11.53) 

for which m = 0, ±1, ±2, . . . , i.e., an integer. 

The half-integral values of m are unacceptable as eigenvalues of a component 
of orbital angular momentum, but we will see that they are admissible as eigenvalues 
of different kinds of angular momentum (spin) and also as eigenvalues of other 
physical quantities that satisfy the angular momentum commutation relations (e.g., 
isospin). 

Equation (11.53) shows explicitly that the eigenvalues of L z are mh. Thus, a 
measurement of L z can yield as its result only the value 0, ±h, +2h, .... Since the 
z axis points in an arbitrarily chosen direction, it must be true that the angular 
momentum about any axis is quantized and can upon measurement reveal only one 
of these discrete values. 

The term magnetic quantum number is frequently used for the integer m because 
of the part this number plays in describing the effect of a uniform magnetic field B 
on a charged particle moving in a central field. 


Exercise 11.15. Use the Cartesian representation 



to show that (* ± iy) m is an eigenfunction of L z . 


With (11.49), the eigenvalue problem (11.33) for L 2 now can be formulated 
explicitly as follows: 


L 2 Y(0, <p) = ~h 2 


1 d 2 


|_sin 2 6 dcp 2 sin 


+ 72 sin 6 7Z Y(6 ' & = n2xm & < 1L54) 

in 6 dd dd_ 


4 

We require the functions Y(d, cp) to be eigenfunctions of L z as well. When we sub- 
stitute from (11.52) and (11.51), we get the differential equation 

J2 


1 


sin 6 dd 

By a change of variables 


- . . „ d® \ 

— sin 0 — — — 


m? 


dd 


sin 2 d 


0 + A® = 0 


| = cos d, F($) = 0(0) 


(11.55) is transformed into 

d_ 

dS 




m 


1 - £ 


F + \F = 0 


(11.55) 


(11.56) 


(11.57) 


For the particular case m — 0, (11.57) assumes an especially simple form, familiar 
in many problems of mathematical physics, 


d_ 

d£ 




+ AF = 0 


(11.58) 


and known as Legendre’s differential equation. Its examination follows a conven- 
tional pattern. 
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Equation (11.58) does not change its form when — £is substituted for £. Hence, 
we need to look only for solutions of (11.58) which are even or odd functions of £. 
Since £ — g implies 6 — > tt — 9 and z — » — z, these functions are symmetric or 

antisymmetric with respect to the xy plane. 

The solution of (11.58) that is regular at £ = 0 can be expanded in a power 
series, 




oo 

m = S a k e 


k = 0 


Substitution into (11.58) yields the recursion relation 

(k + 1 )(k + l)a k+2 + [A - k(k + 1)K = 0 (11.59) 


Equation (11.59) shows that in the even case ( a 1 = 0) all even coefficients are 
proportional to a 0 , and in the odd case ( a 0 — 0) all odd coefficients are proportional 
to flj. As an eigenvalue of a positive operator, A must be a nonnegative number. If 
the series does not terminate at some finite value of k, the ratio a k+2 /a k — > k/(k + 2) 
as k —> oo. The series thus behaves like 2 (1 /k)g k for even or odd k, implying that 
it diverges logarithmically for £ — ±1, that is, for 9 = 0 and tt. For the same reason, 
we exclude the second linearly independent solution of (1 1.58). 2 Such singular func- 
tions, although solutions of the differential equation for almost all values of £, are 
not acceptable eigenfunctions of L 2 . 

We conclude that the power series must terminate at some finite value of k = €, 
where € is a nonnegative integer, and that all higher powers vanish. According to 
(11.59), this will happen if A has the value 


A = €(€ + 1) 


(11.60) 


We have thus rederived the law for the eigenvalues of L 2 , in agreement with the 
results of Section 11.2. The orbital angular momentum quantum number € assumes 
the values 0, 1, 2, 3, ... , and the measured values of L 2 can only be 0, 2 h 2 , 6h 2 , 
12 h 2 , .... It is customary to designate the corresponding angular momentum states 
by the symbols S, P, D, F, . . . , which are familiar in atomic spectroscopy. If there 
are several particles in a central field, lower case letters s, p, d, . . . will be used to 
identify the angular momentum state of each particle, and capital letters S, P, D, . . . 
will be reserved for the total orbital angular momentum. 3 

The conventional form of the polynomial solutions of (11.58) is 

1 d e 

p <® = vTTid? ie - iy (1L61) 


These are called Legendre polynomials. The coefficient of £* in the expansion of 
(11.61) is, for € + k = even, easily seen to be 


a k 


(-l) (f -*> /2 (l + k)(€ + k - 1) ... (k + 2 )(k + 1) 

V ■ V. 


+ A), 


(11.62) 


where the last factor is a binomial coefficient. For € + k = odd, a k = 0. We verify 
readily that (11.59) is satisfied by the coefficients a k , and hence that P^($) indeed 


2 For a rigorous treatment, see Bradbury (1984), p. 473. 

3 See Haken and Wolf (1993), p. 171. 
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solves (11.58). The peculiar constant factor in (11.61) has been adopted because it 
gives 

P t (± 1) = (±1) € (11.63) 

The first few Legendre polynomials are 4 

Po(0 = 1 PM) = k5? - 3£) 

PM) = £ PM) = i(35r - m 2 + 3) (11.64) 

P2(i) = m 2 - 1) p 5 ( a = K6 3r - ioe + m 

Since P € (cos 9) is an eigenfunction of the Hermitian operator L 2 , it is clear from 

the general theorems of Chapter 10 that the Legendre polynomials must be orthog- 
onal. Only the integration over the polar angle 9 concerns us here — not the entire 
volume integral — and we expect that 


/: 


P f (cos 9)P e fcos 9) sin 9 dO — 0 if V A t 


(11.65) 


No complex conjugation is needed because the Legendre polynomials are real func- 
tions. The orthogonality relation 

+1 PM)PA0M= 0 if€' #€ (11.66) 


J-i 


can also be proved directly, using the definition (11.61) and successive integrations 
by parts. The normalization of these orthogonal polynomials can also be obtained 
easily by €-fold integration by parts: 

2 


p MW # = (Ai) 


2 e 

= i-iy 
= (-d € 


d e 

d ? 


(e - iy 


dp 

d? 


(2 f 


1 f +1 

• €!) 2 J-i 


' d 2e 

de e 


(A - i y 


(2 €)! f +1 

• V.) 2 J-i 


( 2 * 


{.e - d € d€ = 


(r - D € J d£ 

(A - l y d£ 

2 


(11.67) 


2 € + 1 


Exercise 11.16. Prove the orthogonality relation (11.66) directly, using the 
definition (11.61). 


As usual in the study of special functions, it is helpful to introduce a generating 
function for Legendre polynomials. Such a generating function is 

oo 

(1 - 2 & + s 2 y 1/2 - 2 PM)s n (M < 1) (11.68) 

n = 0 

To prove the identity of the coefficients P n (if) in (11.68) with the Legendre poly- 
nomials defined by (1 1.61), we derive a simple recurrence formula by differentiating 
(11.68) with respect to s: 

co 

(f - s)( 1 - 2 is + s 2 )^' 2 - 2 nPM)s n ~ l (11.69) 

«=o 


4 For pictorial representations of Legendre polynomials and other orbital angular momentum 
eigenfunctions, see Brandt and Dahmen (1985), Section 9.2. 
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or, by the use of (11.68), 

co co 

(i - S) E Pn(t)s n = (1 - 2 is + s 2 ) X nPni&s*- 1 

n = 0 n ~ 0 

Equating the coefficients of each power of s, we obtain 

(n + 1 )P n+ M) = (2 n + 1 )&>„(& - nP n _M) (11.70) 

By substituting s* = 0 in (11.68) and (11.69), we see that 

P 0 (£) = 1, PM) = £ 

in agreement with (11.64). The equivalence of the two definitions of the Legendre, 
polynomials is completed by the demonstration that P„(£) as defined by (11.61) 
satisfies the recurrence formula (11.70). 

Exercise 11.17. Prove the recurrence relation (11.70) for P n { £) defined in 
(11.61). 


Having solved (11.58), it is not difficult to obtain the physically acceptable 
solutions of (11.57) with m =£ 0. If Legendre’s equation (11.58) is differentiated m 
times and if the associated Legendre functions 


p?(& = (i - er 2 


d m PM) 

di m 


1 

2 e ■ V. 


(i - er 12 


d (+m 

d? +m 


(e - if 


(11.71) 


are defined for positive integers m < €, we deduce that 


d_ 

d£ 


(i - a 


dP?(€) 

d£ 


i - e 


p?(& + m + i mo = o 


(11.72) 


which is identical with (11.57) for A — €(€ + 1). The associated Legendre functions 
with m < € are the only nonsingular and physically acceptable solutions of (11.57). 
These functions are also called associated Legendre functions of the first kind to 
distinguish them from the second kind, which is the singular variety. 


Exercise 11.18. Use the inequality (11 .34) to verify that the magnetic quantum 
number cannot exceed the orbital angular momentum quantum number. 


The associated Legendre functions are orthogonal in the sense that 


* P7(Z)P?’(£) d{=0 if € + V 

J-i 


(11.73) 


Note that in this relation the two superscripts m are the same. Legendre functions 
with different values of m are generally not orthogonal. For purposes of normaliza- 
tion, we note that 



[p?m 2 df - 


2 (l + m)\ 
2t + 1 (€ - m)\ 


(11.74) 


We leave the proof to the interested reader. When f is changed to — P"'(^) merely 

retains or changes its sign, depending on whether € + m is an even or odd integer. 
It is natural to supplement the definition (1 1.71) by defining the associated Legendre 
functions for m — 0 as 


p%0 = PM) 


(11.75) 
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Returning now to (11.52) and (11.54), we see that the solutions of (11.54), which 
are separable in spherical polar coordinates, are products of e inup and P™( cos 0). 
Since (11.55) is unchanged if m is replaced by —m, and since P™ is the only ad- 
missible solution of this equation, it follows that the same associated Legendre func- 
tion must be used for a given absolute value of m. 


Exercise 11.19. 
Legendre functions: 


p?(0 = 


Justify the following alternative definition of associated 


(-1 r « + m)\ _ 2 /2 

2 e •€!(€- m)\ K J d?~ m 



(11.76) 


The first few associated Legendre functions are 

p\(t) = vi - e, pk& = 3^vi - e, pko = 3d - e) (11.77) 


4. Spherical Harmonics. It is convenient to define the spherical harmonics 
Y™(9, (p) as the separable solutions (11.52) that are normalized with respect to an 
integration over the entire solid angle. For m > 0, 


*7(0, <p) = 


/2l + 1 (l - m)\ 

47 r (f + m)l 


(— l) m e imcp Pf(cos 0) 


(11.78) 


Spherical harmonics with negative superscripts (subject to the restriction 
— i < m < f) will be defined by 

17(0, ip) = (-ir[!7 m (0, <P)]* (11.79) 

The spherical harmonics are normalized simultaneous eigenfunctions of L z and L 2 
such that 


h r)Y m 

= _ __J_ = mhY m 

i dip 

l 2 r? = €(£ + i)ft 2 i7 

The first few spherical harmonics are listed below: 


* 

° V477 



Y 


±i 

2 



e ±lip cos 0 sin 0 = 


fl5 (x ± iy)z 
■y 87 t r 2 




(x ± iy f 


(11.80) 

(11.81) 


(11.82) 


e ±2lip sin 2 0 = 




4 Spherical Harmonics 
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Under a coordinate reflection, or inversion, through the origin, which is realized 
by the transformation <p — » cp + tt and 9 — > tt — 6, the azimuthal wave function 
e ,nup is multiplied by (— l) m , andP™(cos 0) by (— l) (+m . Hence, Y'"(6, cp) is multiplied 
by (~1) € , when r is changed to — r. 

The spherical harmonics are thus eigenfunctions of the parity operator U p which 
changes r into — r: 


Up*Kr) = iK-r ) 


We have 


(11.83) 


U P Y?(9, cp) = (-1 YY?(9, cp) (11.84) 

i.e., Ff has definite parity in consonance with the parity (evenness or oddness) 
of the angular momentum quantum number €. This result is compatible with the 
reflection properties of orbital angular momentum. The operator L = r X p is in- 
variant under reflection of all three Cartesian coordinates; it is an antisymmetric 
tensor of rank 2, or an axial vector, since both r and p change sign under reflection. 
Hence, 


[Up, L] = 0 (11.85) 

and it follows that all eigenfunctions of L z and L 2 must have definite parity. Since 
F“ is obtained from Y^ e by repeated application of the raising operator 
L+ = L x + iL y and since U P commutes with L+, all orbital angular momentum 
eigenfunctions with the same value of f must have the same parity. 

The spherical harmonics form an orthonormal set, since 



[17(0, cp)]*Y™'(6, cp) sin 6 dd dcp = 


^e,t' ^m,m' 


( 11 . 86 ) 


Although no detailed proof will be given here, it is important to note that the 
spherical harmonics do form a complete set for the expansion of wave functions. 
Roughly, this can be seen from the following facts. 

(a) The eigenfunctions e ,m<p of L z are complete in the sense of Fourier series in 
the range 0 < cp < 2 tt. Hence, a very large class of functions of cp can be expanded 
in terms of them. 

(b) The Legendre polynomials P 0 (£)> Pi(|), ^ 2 (£)> • • • » are the orthogonal 
polynomials that are obtained by applying the orthogonalization procedure described 
in Section 4.1 to the sequence of monomials 1, £ £ 2 , . . . , requiring that there be a 
polynomial of every degree and that they be orthogonal in the interval — 1 :£ £ < + 1 . 
Hence, g k can be expressed in terms of Legendre polynomials, and any function that 
can be expanded in a uniformly converging power series of £ can also be expanded 
in terms of Legendre polynomials. The same is true, though less obviously, for the 
associated Legendre functions of fixed order m, which also form a complete set as 
€ varies from 0 to °°. 

Hence, any wave function that depends on the angles 6 and cp can be expanded 
in the mean in terms of spherical harmonics (see Section 4.2). 
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Exercise 11.20. Construct P 2 (l) by the orthogonalization procedure described 
above. 


Some of the most frequently used expansions of angular functions in terms of 
spherical harmonics may be quoted without proof. 


cos 6 Y™(6, cp) 


/(€ + m + l)(l - m + 1) 
(2€ + 1)(2€ + 3) 


e+i 


+ 


/ (€ + m){€ — m) 

( 2 € + 1 )( 2 € - 1 ) 


(11.87) 


Y7- 


€-1 


sin 0 e itp Y?(6, <p) = - 


j(£ + m + 1)(£ + m + 2) , 


+ 


(2€ + 1)(2€ + 3) 

/(€ — m)(€ — m — 1) 

(2€ + 1)(2€ - 1) 


€+ 1 


( 11 . 88 ) 


F m+i 


€-1 


sin 0 Y*(0, <p) = 


- m + 1)(€ — m + 2) 
(2€ + 1)(2€ + 3) 


r m— 1 
€+1 


/(€ + m)(€ + m - 1) t 


(11.89) 


(2€ + 1)(2€ - 1) 


F m— l 
€-1 


The effect of the operators L x and L y on Y'7 is conveniently studied by employing 
the operators 


P+ E X 3“ lL/yj 


L 7 y /(,- 


which, according to (11.48) may be written as 

( d d ^ 

L + = he^i h i cot 6 — 

\d0 dcp; 


L_ — —he 


“pi 


.80 


i cot 0 


dcp / 


(11.90) 


(11.91) 


(11.92) 


The effect of d/dcp on F” is known from (11.80). To determine dY/dO we note that 
from the definitions (11.71) and (11.76) 


dP? 


1 


pm+l 


m £ pm _ + »*)(€ 

>-2 “f 


m + 1) 
1 


+ 




pm 

■2 ** 


# vnrp * * i - £ 2 ' € vr^p ■ * i - p 

With £ = cos 6 and the definition (11.78), it is then easy to derive the relations 

L + Y7(d, <p) = hV(t - m)(€ + m + 1)F? + 1 (d, cp) (11.93) 

L_F?(0, cp) = hV(Z + m)(€ - m + 1 )Y7~\d, cp) (11.94) 


These equations do not come as a surprise. The operators L + and L_ are raising and 
lowering operators for the magnetic quantum number, and Eqs. (11.93) and (11.94) 
are merely realizations of (11.42) and (11.43), which were derived directly from the 
commutation relations. The complete agreement between these two sets of equations 
shows that the choice of phase factor made for the spherical harmonics is consistent 
with the choice of phases for C+(Am) in Section 11.2. 

The addition theorem for spherical harmonics is a useful application. Consider 
two coordinate systems xyz and x'y'z'. The addition theorem is the formula express- 
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ing the eigenfunction P e ( cos 9') of angular momentum about the z' axis in terms of 
the eigenfunctions Y™(9, cp) of L z . Figure 11.3 indicates the various angles. The 
position vector r has angular coordinates 9, <p, and 9', cp' in the two coordinate 
systems. The direction of the z' axis in space is specified by its polar angle /3 and 
its azimuth a with respect to the unprimed system. Since P ( is an eigenfunction of 
L 2 , only spherical harmonics with the same subscript £ can appear in the expansion. 
An interchange of 9, cp, and (3, a is equivalent to the transformation 9' — > - 9 ' and 
must leave the expansion unchanged, because P e ( cos 9') is an even function of 9'. 
Hence, P e { cos 9') can also be expanded in terms of Y™((3, a). In a rigid rotation of 
the figure about the z axis, a and <p change by equal amounts, and 9' remains con- 
stant. Hence, P ( (c os 9') must be a function of cp ~ a. All these requirements can 
be satisfied only if 

P € (cos 9') = 2 c m YJ m (fi, a)Y>?(9, cp) (11.95) 

m= —t 

The coefficients c m can be determined by using the condition 

L z ,P t { cos 9') = 0 (11.96) 


Since 


L z ’ = sin (3 cos a L x + sin /3 sin a L y + cos (3 L z 
— - sin /3 e~ lol L+ + ^ sin (3 e ,a L- + cos /3 L z 


(11.97) 


Eqs. (11.87), (11.88), (11.89), (11.93), and (11.94) may be used to evaluate 
L z 'P e ( cos 9'). If the linear independence of the spherical harmonics is invoked, we 
obtain, after some calculation, the simple result 

Cm ± 1 

Thus, c m = (~l) m c 0 , and only c 0 need be determined. For this purpose we specialize 
to (3 = 0, or 9 — 9' . Since, from the definitions of Y ™ and Pf, 


Y?( 0, cp) = 


2 £ + 1 


47T 


J m0 


(11.98) 


and 


Y° ( (9, cp) 


2 £ + 1 


477 


P e ( cos 9) 


it follows that 


c o 


477 


2£ + 1 

With (11.79), this proves the addition theorem in the form 


F f (cos 6') — 


477 


2£ + 1 m tL t 


2 Yf(J3, a)YT(9, cp) 


(11.99) 


( 11 . 100 ) 
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The completeness of the spherical harmonics as basis functions for any function 
of the angles 0, cp, or of the direction of the vector r, is expressed by the closure 
relation: 

co € 

2 2 [Y?((3, a)]*Y?(6, cp) = 8 ( ?, ?') (11.101) 

( = 0 m=-t 

The solid-angle delta function on the right-hand side is equal to zero unless the two 
vectors r (0, cp) and ? '(ft, a) coincide. It has the property 

| /(f ')S(f, *') dSV = /(f) (11.102) 

for any function f(r) of the spatial direction specified by 6, cp. 5 If (11.101) is com- 
bined with the addition theorem (11.100), the identity 

(11.103) 

is obtained. 

The delta function in three dimensions has a representation in spherical polar 
coordinates, 

<5(r - r') = 8(r ~ r - S(f, r') (11.104) 

r 

Sence, we infer the further identity: 

(11.105) 

This formula will be useful in the theory of scattering from a spherical potential. 

Exercise 11.21. Check (11.105) by integrating both sides over all of 3-space. 

5. Angular Momentum and Kinetic Energy. Since the kinetic energy is repre- 
;ented in the coordinate (or momentum) representations by an operator proportional 
o V 2 , it is expedient to relate L 2 to the Laplacian. We make the calculation using 
he concise notation that takes advantage of the summation convention (summing 
‘rom 1 to 3 over repeated indices) and of the Levi-Civita asymmetric (third-rank 
ensor) symbol s ijk . The Levi-Civita symbol is defined as follows: 

1 ijk = 123, 231, 312 (even permutation of 123) 

^ijk = ■ — 1 ijk = 321, 213, 132 (odd permutation of 123) 

0 when two or more indices are equal 

Jsing the summation convention, we formulate a simple identity, 

SijkSnqk ^in^jq ^iq^jn (11.106) 

5 The delta function 8(r, r ') is sometimes written as 5(0 — O'), but this is misleading, because 
t implies that O is a variable of integration. There is no such solid angle variable. 
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Exercise 11.22. Prove the equality (11.106). 

It is evident that the components of orbital angular momentum can be written 
in the form 


l^k ifi&ijk %i - 


dXj 


(11.107) 


It follows that 


L LkLfc fl &ljk^nqk X i ^ ( &in ^jq &iq^jil)Xi X n 

\ d 


— —h\ X, X; X; Xj 

\ dXj dXj dXj J dx t/ 


= -ft? 


2 / 5xA d d 2 

X, X t Xj 


d d 

X 1- XiXi 

dx t dXjdXj \dXjJ dXi 


dXidXj J 


- -ft 2 r 2 V 2 - 2r • V - X'Xj 


d 2 \ ~( 2 2 d 2 d 2 

— — = -h \ r 2 V 2 -2 r r 2 — y 

dx t dXj J \ dr dr 2 / 


and we arrive at the important identity 


L 2 

. . d 1 

( 9 d\ 


— r 2 V 2 H 


h 2 

dr 

l dr) 


(11.108) 


Exercise 11.23. Line by line, work through the steps leading to Eq. (11.108). 

An alternative, representation-independent, method for deriving (11.108) starts 
with the operator identity, 

L 2 - (r X p) • (r X p) = — (r X p) • (p X r) 

= -r • [p X (p X r)] = -r • [p(p • r) - p 2 r] 

Since 


[r, p 2 ~\ — 2ifi\t 
and 


p r — r • p = —3 ihl 

we obtain 

L 2 = r 2 p 2 + ihr ■ p — r • p r • p (11.109) 

The component of the gradient V/ in the direction of r is df/dr\ hence, in the (spher- 
ical) coordinate representation, 


h 

r p = t 

i 


d 

r — 
dr 
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nd consequently 


L“ = r 2 p 2 + h 2 r V h 2 r 


— r 2 p 2 + 2 h 2 r — + h 2 r 2 


dr 

d 


dr dr 


dr 


dr z (11.110) 


— r 2 p 2 + — | r 


2 _ r 2 _ 
dr I drj 


i agreement with (11.108). 

Since L and therefore also L 2 commutes with any function of r, the kinetic 
nergy operator is related to angular momentum by 


T — P 2 — 

L 2 

h 2 d | 


2m 

2 mr 2 

2 mr 2 dr 

\ dr) 


( 11 . 111 ) 


lere we see explicitly that L commutes with T, since it is patently irrelevant whether 
ifferentiation with respect to r is performed before or after a rotation about the 
rigin. 

In order to establish the connection between the eigenfunctions of L 2 and the 
olutions of Laplace’s equation, we consider the eigenvalue problem for the last 
irm in (11.108): 


d 2 df(r) 

— r 

dr dr 


= cm 


( 11 . 112 ) 


’his has the solution 


nd the eigenvalue 


f{r) = A/ + Br-*- 1 


c = m + i) 


(11.113) 

(11.114) 


ince L 2 acts only on the variables 6 and cp, and not on r, we see from (11.108) and 
11.81), and by choosing the solution (11.113) which is not singular at the origin, 
lat 


V 2 r e Y?(d, <p) = 0 (11.115) 

'hus, the functions r e Y’"(6, <p) are regular solutions of Laplace’s equation. From the 
efinition (11.78) of spherical harmonics and inspection of the formula (11.71) for 
ssociated Legendre functions it follows that the functions r e Y™(6, <p), when con- 
erted into functions of x, y, z, are homogeneous multinomials of degree €. With 
onnegative integer powers r, s, t, these functions can be expressed as 2 a rst x r y s z l , 
abject to the constraint r + s + t = £ and the requirement that they must be 
□lutions of Laplace’s equation, which accounts for their designation as harmonic 
motions. 

Exercise 11.24. Show that the homogeneous multinomial 

F = ^ a rst X r f z‘ 

r+s+t=f 

as (€ + 1)(€ + 2)12 coefficients and that the linear relations between them imposed 
y the requirement V 2 F = 0 leave 2i + 1 coefficients to be chosen arbitrarily, so 
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that the number of linearly independent harmonic multinomials of degree € equals 
the number of orthogonal spherical harmonics of order 

Aided by the formula connecting the kinetic energy with angular momentum, 
we are now prepared to tackle the central-force problem in quantum mechanics. 


Problems , 

1. For the state represented by the wave function 

if/ = Ne~ ar2 (x + y)z 

(a) Determine the normalization constant A as a function of the parameter a. 

(b) Calculate the expectation values of L and L 2 . 

(c) Calculate the variances of these quantities. 

2. For a finite rotation by an angle a about the z axis, apply the rotation operator U R to 
the function /( r) — ax + by, and show that it transforms correctly. 

3. Explicitly work out the J matrices for j = 1/2, 1, and 3/2. 

4. Classically, we have for central forces 

- 2 L 2 


H = „ 2 

2m 2 mr 2 


+ V(r) 


where p r = (l/r)(r • p). Show that for translation into quantum mechanics we must 
write 


1 


Pr = 


- (r • p) + (p • r) - 
r r 


and that this gives the correct Schrodinger equation with the Hermitian operator 

h f d l\ 

P ' = J[fr + rj 

whereas ( h/i)(d/dr ) is not Hermitian. 

5. Show that in D-dimensional Euclidean space the result of Problem 4 generalizes to 


1 


Pr = 


-(r 

r 


P) + (P 


r)- 

r 


hid D - 1 


" 7 U + 


2 r 
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Spherically Symmetric Potentials 


If the potential energy is rotationally invariant, and thus dependent only 
on the distance r from a center of force, chosen as the coordinate origin, 
orbital angular momentum is conserved. This constant of the motion 
enables us to reduce the three-dimensional Schrodinger equation to an 
ordinary differential equation, the radial equation, analogous to the 
reduction of a central-force problem in classical mechanics to a dynamical 
problem for the radial coordinate r alone, provided that angular 
momentum conservation is used and the inertial centrifugal force 
introduced. As examples of central potentials, we solve the radial 
Schrodinger equation for the trivial case of a free particle (V = 0), the 
spherical square well, and the attractive Coulomb potential (the 
one-electron atom). 


1. Reduction of the Central-Force Problem. Since the Hamiltonian 


P 

2m 


V(r) 


( 12 . 1 ) 


for a particle of mass m moving in a central-force field commutes with the orbital 
angular momentum operator, 


[H, rXp]-[ff,L] = 0 (12.2) 

angular momentum is a constant of the motion for a particle moving in a rotationally 
invariant potential. The operators H, L z and L 2 all commute with each other, 

[H, L z \ = [H, L 2 ] - \L Z , L 2 ] - 0 

in this case, and we can therefore require the energy eigenfunctions also to be 
eigenfunctions of L z and L 2 . These eigenfunctions must then be of the separable 
form 


<A(r) - R E (r)Y?(d, <p) (12.3) 

when spherical polar coordinates are used. The equation that is satisfied by the radial 
factor R(r) is found if we express the Hamiltonian in terms of orbital angular mo- 
mentum. Since L and therefore also L 2 commutes with any function of r, we may 
use (11.111) to write the Schrodinger equation 

Hifj s [T + V(r)\ip = Eilr 


for central forces in the form 


fl 2 d 

(r 2 d \ 

2 mr 2 dr 

l dr) 


+ 


L 2 h 2 
2 mr 2 


+ V(r) 


if/ = Eif/ 


(12.4) 
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If the separated form (12.3) is substituted, this equation can be reduced to the or- 
dinary differential equation for the radial eigenfunction R E (r)\ 


“a | 

M 

1 

I 

( r 2 l) 

2 mr 2 dr 

l dr) 


+ 


1(1 + 1 )h 2 

2 mr 2 


+ nr) 


R E (r) = ER E (r) 


(12.5) 


which is easier to solve than the original partial differential equation. Our procedure 
is entirely equivalent to the familiar separation of variables of the Laplacian oper- 
ator in spherical polar coordinates, but we emphasize the physical meaning of the 
method. 


Exercise 12.1. If you have never done it before, carry through the explicit 
calculation of V 2 in terms of spherical polar coordinates and derive (12.4) by com- 
parison with (11.49). (See Appendix, Section 3, for a general formula for the 
Laplacian in curvilinear coordinates.) 


It is sometimes convenient to introduce yet another radial wave function by the 
substitution 


u(r) = r R(r) 


From (12.6) we find that u(r) obeys the radial equation 


h 2 d 2 u 

\* 2 W + l) i wrtl 

u — Eu 

2m dr 2 

2mr 2 V) _ 


( 12 . 6 ) 


(12.7) 


This equation is identical in form with the one-dimensional Schrodinger equation 
except for the addition of the term h 2 £(£ + l)/2 mr 2 to the potential energy. This 
term is sometimes called the centrifugal potential, since it represents the potential 
whose negative gradient is the centrifugal force. 

Although (12.7) is similar to the one-dimensional Schrodinger equation, the 
boundary conditions to be imposed on the solutions are quite different, since r is 
never negative. For instance, if if/ is to be finite everywhere, w(r) must vanish at 
r — 0, according to the definition (12.6). A detailed discussion of these questions 
requires specific assumptions about the shape of the potential energy, and in this 
chapter the radial Schrodinger equation will be solved for several particular cases. 

2. The Free Particle as a Central-Force Problem. In Section 4.4 the Schrodinger 
equation for a free particle (V — 0), with energy E(> 0), 

~ V 2 *A - Eilr (12.8) 

2m 

was treated quite naturally by the method of separation of variables using Cartesian 
coordinates, since these coordinates are particularly well suited for describing trans- 
lations in Euclidean space. Nevertheless, it is useful also to look at the free particle 
problem as a special case of a potential that depends only on the radial coordinate 
r. The energy eigensolutions of Eq. (12.8) can then be assumed to be separable in 
spherical polar coordinates in the form (12.3). For V = 0, the function R(r) (omitting 
the subscript E for brevity) must satisfy the radial equation 

h 2 d / 2 €(€ + l)h 2 

Imr 2 dr \ dr) 2mr 2 


R(r) = ER(r) 


(12.9) 
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or, since, hk = V2 m E, 


1 d ( -dR\ 
r 2 dr V dr ) 


+ m±n R = k * R 


( 12 . 10 ) 


If we scale the radial coordinate by introducing the dimensionless variable 


P = 



( 12 . 11 ) 


the radial equation reduces to 


d 2 R ] 2dR 
dp 2 p dp 


1 - 


i(i + i) 



( 12 . 12 ) 


This differential equation is seen to be related to Bessel’s equation if we make the 
transformation 


R = 


M 

Vp 


(12.13) 


and obtain 


d 2 J | 1 dJ 
dp 2 p dp 



(€ + 1 / 2) 2 


J — 0 


(12.14) 


The regular solutions of this equation, which do not have a singularity at p = 0, 
are the Bessel functions J e +i/ 2 (p)- By (12.13) they are related to the regular solutions 
of Eq. (12.12), which are defined as 


je(z ) = 


1 2 Z Je+m(z) 


(12.15) 


and known as spherical Bessel functions. That the latter satisfy Eq. (12.12) is easily 
verified if their integral representation 

uz) = C 1,10(1 “ l2)< * (12 ' 16) 

is used. (The variable z, rather than p, is used in the last equations to emphasize that 
these formulas are valid for all complex values of the variable). The first term in 
the series expansion of (12.16) in powers of z is 

m - WTW z ‘ + 0fe<+2> (12J7) 

We note that the spherical Bessel functions are even or odd functions of their ar- 
gument, depending on the parity of €. 

We thus see that the spherical Bessel function jfkr) is the regular solution of 
the radial equation (12.12) and that the radial eigenfunction of the Schrodinger equa- 
tion (12.10) for the free particle is 


R(r) = Cjfkr) 


(12.18) 
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A useful formula linking Bessel functions with Legendre polynomials is obtained 
by integrating (12.16) by parts i times and using the definition (11.61). This leads 
to 


je(z) 


1 f +1 

^ 2 ? J-. 


P e (s) ds 


(12.19) 


The asymptotic form of the spherical Bessel functions can be derived from this 
expression by further integration by parts, and the leading term is 


je(p) = 


CQS[p - (l + 1) 77-/2] 
p 


(p » i) 


( 12 . 20 ) 


All other solutions of (12.12) are singular at the origin and not admissible as energy 
eigenfunctions for the free particle. 

Exercise 12.2. Verify that the asymptotic expression (12.20) for j e (p) satisfies 
the differential equation (12.12) to second order in p -1 . 


A particularly simple singular solution of Eq. (12.12) is obtained from j e by 
noting that the differential equation is invariant under the substitution 

1 ( 12 . 21 ) 


If this transformation is applied to the asymptotic form (12.20), we obtain a linearly 
independent solution that can, for large positive p, be written in the form 


n { (p) ~ 


sin[p — (€ + 1 ) tt/2] 
P 


(P » €) 


( 12 . 22 ) 


This particular singular solution of the radial equation, being asymptotically out of 
phase by tt/2 compared to the regular solutions j { (p), is sometimes distinguished as 
“the” irregular solution, although any linear combination of j t and n { is also singular 
at the origin. 

To exhibit the behavior of n ( near the origin, it is merely necessary to subject 
the expression (12.17) for j e to the transformation (12.21). We see that the singularity 
n e at the origin is contained in the leading term which is proportional to z _€_1 . Its 
coefficient is most easily computed by applying Eq. (3.7) to the two solutions 

= UP)Y? and = n t (p)Y? 


of the Schrodinger equation. Since if/ x and if/ 2 correspond to the same energy, we 
infer from (3.7) by application of Gauss’ divergence theorem that 


je(p) 


dn e (p) 

dp 


djfip) 

dp 


n e(p) 


— constant 


(12.23) 


This expression is the analogue of the Wronskian for the one-dimensional Schro- 
dinger equation. Substitution of (12.20) and (12.22) into (12.23) shows that the 
constant has value unity for large p. Hence, its value must also be unity as p -» 0. 
Using the approximation (12.17) for j ( near the origin, we derive from (12.23) for 

z ~ 0 


n e (z) = 


2 € • V. 


+ 0(z~ t+1 ) 


(12.24) 


The function n e (z) is known as the spherical Neumann function. 
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Exercise 12.3. Show that for any two solutions, R r and R 2 , of the radial equa- 
tion (12.5), the condition 

t 2 (r 1 — constant (12.25) 

\ dr dr J 

holds. Check this for the free particle as r — > °° and r -» 0. 0 

Two other useful singular solutions of Eq. (12.12) are the spherical Hankel 
functions of the first and second kind, defined by 

h c e\z) = jfiz ) + infiz) (12.26) 

and 

hf\z) = jfiz) - infiz) (12.27) 

The generic name for the solutions of Bessel’s equation (12.14) is cylinder functions, 
and the solutions of (12.12) are known, paradoxically, as spherical cylinder func- 
tions. The information we compile in this section about these special functions will 
be used in Section 12.3 and in Chapter 13 on scattering. 

Like the Neumann function, the Hankel functions diverge as near the 

origin: 

h^\z) = z ~‘~' + oa2) fe~"'> 02.28) 


and their asymptotic behavior for large positive p is seen from (12.20) and (12.22) 
to be 


h ( e\p) - ~ exp {i[p - (€ + 1)tt/2]} 
P 

hf\p) = - exp {-i[p - (€ + 1)7t/2]} 
P 


(12.29) 


The explicit forms of the spherical Bessel, Hankel, and Neumann functions for € = 
0, 1 and 2, are given below: 


jo(z) = 


sin z 


jfiz) 


sin z cos z 


jfiz . ) = 


nfiz) = 


-t I sin z y cos z 

V Z Z Z 


cos z 


nfiz) 


cos z sm z 


1 


n 2 (z) = - ^ cos z 2 sin z 


z 2 


,iz 


ho\z) = ~i~, hfifiz) = I --j 

z \ z Z) 


h?Xz) = ( ~ - 4 + - 


3/ 3 i 

zr z 2 z> 


(12.30) 


(12.31) 


(12.32) 


hfXz) = i 


h?(z) = 


3 i 


h?Xz) = 


~2 ~ ~ e 
z 


T 2 


- \e 


(12.33) 
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Exercise 12.4. Verify that for E < 0 none of the solutions of the free particle 
radial equation are physically acceptable owing to their asymptotic behavior. 

The regular radial eigenfunctions of the Schrodinger equation for V = 0 con- 
stitute a complete set, as a consequence of a fundamental theorem concerning Sturm- 
Liouville differential equations, 1 of which (12.12) is an example. Hence, we have 
before us two alternative complete sets of eigenfunctions of the free particle Ham- 
iltonian . They arfe the plane waves e !k r and the spherical waves j e (kr)Y™(6, <p), where 
hk = V2 m E. Both sets exhibit an infinite degree of degeneracy, but for a given 
value of the energy, the number of plane waves is indenumerable, while the number 
of spherical waves is denumerable, corresponding to the countability of the integer 
quantum numbers i and m. Nevertheless, these two sets of eigenfunctions are equiv- 
alent, and one kind must be capable of expansion in terms of the other, posing the 
problem of determining the coefficients in the expansion 

co € 

e ‘ kr = E I c^ m j ( (kr)Y'l'(6, <p) 

€ = 0 m=-t 

Actually, it is sufficient to specialize this relation to the case where k points along 
the z axis and consider the expansion 

oo 

e ikz = e ikrcose = X a ( j t {kr)P,{ cos 6) 

e=o 

From the orthogonality and normalization properties of Legendre polynomials we 
obtain (with £ = cos 9) 

2t + l r +1 

a*Mkr) = — — J_ i e** />,(£» (12.34) 

which we compare with (12.19) to establish the identity 

oo 

e ikz = X (2€ + l)i e j ( (kr)P ( (cos 6) (12.35) 

£^2 

This formula is especially useful in scattering theory. The more general expansion, 
with k pointing in an arbitrary direction, is obtained from (12.35) by use of the 
addition theorem for spherical harmonics: 


oo 

e' kr = X (2€ + l)i e jf(kr)P e (k • f) 

e=o 

oo + £ 

— 4ir X X i e j e (kr)[Y?(P, a)]*Y?(9, cp) 

e=o m=-e 


(12.36) 


where a and (3 denote the spherical polar coordinates of the vector k. 

A useful asymptotic approximation to Eq. (12.36) is derived by substituting 
(12.20) on the right-hand side. For kr » 1 we get 


- = S ( 2<? + i )i’ coslkr ~ (t + 1)j/2] r t( t • f) 

f=o kr 


( = 0 
„ ikr <» 


ikr “> — tkr 1:0 

= ^ 7 - E ( 2 € + l)P,(k • f) - — - X ( 2 € + l)(-l//yk . f) 

2 ikr f=o 2 ikr t = o 


(12.37) 


! Morse and Feshbach (1953), p. 738. 
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vhich by use of Eq. (11.103) reduces to 


e 


ik-r 


2 ire ikr 
ikr 


8(k, r) - 


2t Te~ ikr 
ikr 


8(k, -f) 


(12.38) 


i very convenient formula in scattering theory. It is seen that the leading term in 
he asymptotic expansion of the plane wave <?' k r contributes only in the forward and 
>ackward directions, which is a physically reasonable result. 


The Spherical Square Well Potential. The spherically symmetric square well 
n three dimensions is of interest because it is mathematically straightforward and 
ipproximates a number of real physical situations. Unlike the Coulomb potential, 
vhich gives rise to infinitely many discrete energy levels for bound states, the square 
veil, owing to its finite range and finite depth, possesses only a finite number of 
;uch levels. 

A square well is a central potential composed of two constant pieces: V = — V 0 
or r < a and V = 0 for r > a (with V 0 > 0). The particle is free inside and outside 
he well, and subject to a force only at the discontinuity at r = a. In this section, 
he emphasis will be on the bound states of a particle in such a potential. The radial 
vave equation for a state of angular momentum € is 


h 2 d_ ( 2 dR\ 
2 mr 2 dr\ dr j 

JL ± ( r 2 

2 mr 2 dr \ dr J 


+ 

+ 


h 2 m + 1 ) 

2mr 2 

fi 2 l(l + 1) 

2 mr 2 


R — (E + V 0 )R for r < a 
R — ER for r > a 


(12.39) 

(12.40) 


? or bound states — U 0 < £ < 0. 

The condition of regularity at the origin again restricts us to the spherical Bessel 
"unction for the solution inside the well. All the results of Section 12.2 apply pro- 
dded that we take into account the fact that E must be replaced by the kinetic energy, 
S — V — is + V 0 - Thus 


R(r) = Aj ( 


2m(E + Vo) 


h 2 


for r < a 


(12.41) 


Outside the well we must exclude any solution of (12.40) that would increase 
;xponentially at large distances. Since E < 0 for bound states, (12.40) has the same 
solutions as (12.10), but k is now an imaginary number. If we define 



t is easily verified from the asymptotic forms in Section 12.2 that only the Hankel 
"unction of the first kind decreases exponentially. The eigenfunction outside the well 
nust thus be of the form 


R(r) - Bh\ l) 



for r > 0 


(12.42) 


rhe interior and exterior solutions must be matched at r ~ a. In conformity with 
he analogous one-dimensional problem (see Section 3.5), the radial wave function 
md its derivative are required to be continuous at the discontinuity of the potential, 
dence, the logarithmic derivative, {HR) dR/dr or (1/m) duldr , must be continuous. 
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This condition, applied to (12.41) and (12.42) yields an equation for the allowed 
discrete energy eigenvalues: 2 



j' e (Va 2 - k 2 a) 
j/.V ol 1 - k 2 a) 


h’ t m {Ka) 

h ( t\Kd) 


(12.43) 


where a 2 = 2m VJft 2 

The solutions for positive E are asymptotically oscillatory and correspond to 
scattering states in which the particle can go to infinity with a finite kinetic energy. 
They will be studied in Chapter 13. 


Exercise 12.5. Compare the energy eigenvalues for S states in the three- 
dimensional square well with the energy eigenvalues of a one-dimensional square 
well of the same depth and width. 

Exercise 12.6. If H is the sum of a Hermitian operator H 0 and a positive 
definite perturbation V, prove by a variational argument that the ground state energy 
of H 0 lies below the ground state energy of H. Apply this theorem to prove that in 
a central potential the ground state of a bound particle is an S state. 


Exercise 12.7. 


Show that a spherical square well has no bound state unless 


V 0 a 2 > 


h 2 ir 2 

8m 


4 . The Radial Equation and the Boundary Conditions. We now return to a gen- 
eral discussion of the radial equation for central forces. From Section 12.1 we know 
that the solutions of the Schrodinger equation can be constructed as 

<Kr, 0, <p) = — Y?(d, <p) (12.44) 

r 


Since r does not change under reflection, these wave functions have the same parity 
as Y™. Hence, for even i we have states of even parity, and for odd € we have states 
of odd parity. 

The radial wave function u{r) must satisfy the equation 


h 2 d 2 u 
2m dr 2 


fi 2 l(l + 1) 

2 mr 2 


+ V(r) 


u = Eu 


(12.45) 


The general principles of quantum mechanics require that the eigenfunctions (12.44) 
be normalizable. Since the spherical harmonics are normalized to unity, the eigen- 
functions corresponding to discrete eigenvalues must satisfy the condition 



dr dCl = 



u*u dr 


1 


(12.46) 


If E lies in the continuous part of the spectrum, the eigenfunctions must be nor- 
malized in the sense of (4.33), or 



u E *u E ' dr — 


S(E - E') 


(12.47) 


2 Schiff (1968), p. 86, gives useful recurrence relations for spherical cylinder functions and their 
derivatives. 
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Most situations of practical interest are covered if we assume that V(r) is finite 
everywhere except possibly at the origin and that near r = 0 it can be represented 
by 3 

V(r) - cr a (12.48) 


with a an integer and a > — 1. Furthermore, we assume that V — > 0 as r — » oo. 

We must not forget that, since division by r is involved, (11.111) is not a 
representation of the kinetic energy at the coordinate origin. For the same reason 
(12.45) is valid only for r ^ 0 and must be supplemented by a boundary condition 
at r — 0. Without going into detail, we note that the appropriate boundary condition 
is obtained by demanding that the Hamiltonian, or energy, operator must be self- 
adjoint in the sense of (4.35). This is the condition which consistency of the prob- 
ability interpretation of quantum mechanics imposes on the eigenfunctions of H. 

Applying this requirement to the operator 


H = 


JLl (,**.) 

2 mr 2 dr \ dr) 


+ 


L 2 

2 mr 2 


+ V(r) 


we find, by integrating by parts, that any two physically admissible eigensolutions 
of (12.45) must satisfy the condition 


lim 

r — *0 




= 0 


(12.49) 


In applications, this condition usually may be replaced by the much simpler one 
requiring that u{r) vanish at the origin: 


w(0) - 0 (12.50) 

In most cases, this boundary condition singles out correctly the set of eigenfunctions 
that pass the test (12.49), but mildly singular wave functions are occasionally en- 
countered (e.g., in the relativistic theory of the hydrogen atom, Section 24.9). 

If in the immediate vicinity of the origin V can be neglected in comparison with 
the centrifugal term, which for r — > 0 increases as 1/r 2 , (12.45) reduces near r = 0 
to 


d 2 u 

dr 2 


+ 1 ) 


u — 0 


(12.51) 


for states with € + 0. Potentials of the form (12.48) at small r, including the square 
well and the Coulomb potential, are examples of this. The general solution of (12.51) 
is 


u = Ar t+1 + Br~ e (12.52) 

Since € > 1, the boundary condition (12.49) or (12.50) eliminates the second so- 
lution; hence, B — 0. Thus, for any but S states, u(r ) must be proportional to r e+1 
at the origin and i jj must behave as r e . Hence a power series solution of (12.45) must 
have the form 


u(r) — r (+1 (a 0 + a x r + a 2 r 2 + ■ ■ •) (12.53) 


3 For potentials that are more singular at the origin, see Morse and Feshbach (1953), pp. 1665- 


1667. 
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If T - 0 (S states), the terms in (12.45) containing V and E cannot be neglected, 
and a separate investigation is required to obtain the behavior of the wave function 
near the origin. Even then the form (12.53) remains applicable for S states in most 
cases. 

Exercise 12.8 . For a potential V = —Clr and angular momentum i = 0, show 
that the general solution of (12.45) is of the form 

u = j r 2 + • • J + B(l - C log r +-■ • •) (12.54) 

for small values of r and infer that for S states again we must require that B — 0. 


Assuming that the potential energy vanishes at great distances, the radial equa- 
tion (12.45) reduces to 



2m E 

~W 


u — 0 


(12.55) 


as r — > 00 . Equation (12.55) possesses oscillatory solutions for positive E and ex- 
ponential solutions for negative E, with the increasing exponential excluded by the 
condition that must be normalizable in the sense of (12.46) or (12.47). 

If E < 0, the eigenfunctions have the asymptotic behavior 

( V-2 m E 
u(r ) = exp I r 


representing spatially confined, or bound, states. The boundary conditions will in 
general allow only certain discrete energy eigenvalues. 

For bound states, the radial equation is conveniently transformed by the intro- 
duction of the dimensionless variable 


P = 



(12.56) 


Sometimes it is also convenient to remove from the unknown dependent variable 
the portions that describe its behavior at r — 0 and r = °°. Thus, we introduce a 
new function w(p ) by setting 


w(p) = p ( + l e “w(p) 


(12.57) 


Substituting this expression into (12.45), we obtain 


d 2 w ( i + 1 A dw 
dp * + \~P~ / dp 


V 

E 


2(1 + 1 ) 

P 


w — 0 


(12.58) 


Of the solutions of this equation, we seek those that satisfy the boundary condition 
at infinity and at the origin. 


5. The Coulomb Potential. Let us now suppose that V is the potential energy 
of the Coulomb attraction between a fixed charge Ze and a moving particle of 
charge —e. 



r 


(12.59) 
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For the hydrogen atom, — e is the charge of the electron and Z = 1. According to 
the discussion of the last section and especially Eq. (12.52), the radial wave function 
u{r) must behave as r t+l near the origin. This is also true for S states, as shown in 
Exercise 12.8. 

The energy levels and eigenfunctions of bound states in the nonrelativistic the- 
ory will be discussed in this section and the next. The energy continuum ( E > 0) of 
a particle in a Coulomb potential is the subject of Section 13.8 on scattering. 

For convenience we introduce a dimensionless parameter 


Ze 2 K _ [2m Ze 2 

Po ~ |£j “ y \E\ h 


(12.60) 


such that 


E = _P 
V Po 


(12.61) 


For this potential, the differential equation (12.58) can then be written as 


d 2 w „ dw 

p — + 2(€ + 1 - p) — + [p 0 - 2(€ + 1 ))w = 0 


dp 


dp 


(12.62) 


A simple two-term recursion relation is found if we expand w(p) in a power series: 

w(p ) = a 0 + a x p + a 2 p 2 + • • • (12.63) 

We substitute (12.63) into (12.62) and equate to zero the coefficient of p k . The result 
is, for k > 0, 

(k + 1 )k a k+ 1 + 2(i + 1 )(k + 1 )a k+l -2 k a k + [p 0 ~ 2(€ + 1 )]a k = 0 


or 


a k + 1 _ 2(y( + i + i) - po 
a k (k + 1 )(jfc + 2€ + 2) ^ } 

This recursion relation shows that, for k > 0, the coefficients a k are proportional to 
a 0 A 0. This power series must terminate at some finite maximum power. If it failed 
to do so, all terms with k > (1/2 )p 0 - (€ + 1) would have the same sign. Further- 
more, it is easy to verify that if k > C p 0 + 2 (C — 1)(€ + 1), where C is a constant, 
C > 1/2, the ratio a k+l /a k is always greater than the corresponding ratio of coeffi- 
cients in the power series expansion of e < 2_1/c ^. if c is chosen to be greater than 1, 
it follows from (12.57) that for large values of p the radial eigenfunction u(p) di- 
verges faster than e°~ 1/C)p . Such a strongly divergent wave function is not quadrat- 
ically integrable and is not acceptable to represent the physical system. Flence, the 
series (12.63) must terminate, and w(p) must be a polynomial. Let us suppose its 
degree to be N, that is, a N+1 = 0, but a N A 0. Equation (12.64) leads to the condition 


Po = 2 (N + € + 1) 


(12.65) 


where € = 0, 1, 2 . . . and N = 0, 1, 2 

Exercise 12.9. Assume, contrary to the conventional procedure, that the radial 
eigenfunctions for a bound state of the hydrogen atom can be written as 

u = p k e~ p ^b 0 + ^ 


( 12 . 66 ) 
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Obtain the recursion relations for the coefficients, and show that the boundary con- 
ditions give the same eigenvalues and eigenfunctions as usual. [A series like (12.66) 
in descending powers of r can be useful for calculating approximate radial eigen- 
functions, if V behaves as the Coulomb potential at large, but not at small, distances.] 

It is amusing to contemplate that as innocuous an equation as (12.65) is equi- 
valent to the Balmer formula for the energy levels in hydrogenic atoms. To see this, 
we merely substitute (12.60) into (12.65) and define the principal quantum number 

n = N+Z+\ = y (12.67) 

The result is 

( 12 . 68 ) 

As is well known from the elementary Bohr theory, 

(12.69) 

sets the length scale in the quantum description of the hydrogenic atom. The length 
a is termed the first Bohr radius of hydrogen if m = m e is the mass and — e the 
charge of the electron. Its numerical value is 

a = 0.529177 X 1(T 8 cm 

Using this quantity, we can write the energy simply as 

zV 

E n = -—2 ( 12 - 7 °) 

2 an 

Also, we see that 

Z Z 

k — — and p = — r (12.71) 

na na 

Since N is by its definition a nonnegative integer, it is obvious from (12.67) that n 
must be a positive integer, 

n > 1 (12.72) 

The ground state of the hydrogen atom corresponds to n — 1, £ = 0, with an energy 
of approximately —13.6 eV. There are infinitely many discrete energy levels; they 
have a point of accumulation at E 0 — 0 (n — » °o). 

The fact that the energy depends only on the quantum number n implies that 
there are in general several linearly independent eigenfunctions of the Schrodinger 
equation for hydrogen corresponding to the same value of the energy. In the first 
place, 2£ + 1 different eigenfunctions of the same energy are obtained by varying 
the magnetic quantum number m in integral steps from — £ to £. Second, there are 
n values of £(£ = 0, 1, 2, . . . , n — 1) consistent with a given value of n. Hence, all 
energy levels with n > 1 are degenerate, and the number of linearly independent 
stationary states of hydrogen having the energy (12.68), the degree of degeneracy, 
is 

E (2€ + 1) = n 2 

€=0 




(12.73) 
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For example, in standard spectroscopic notation ( n followed by the symbol for €), 
the first excited level of hydrogen (n = 2) is fourfold degenerate and consists of the 
2 S state and three 2 P states. (The degeneracy is doubled if the spin is taken into 
account.) 

The occurrence of degeneracy can often be ascribed to some transparent sym- 
metry property of the physical system. For instance, the degeneracy with respect to 
magnetic quantum numbers is clearly present for any central potential. It has its 
origin in the absence of a preferred spatial direction in such systems, and it reflects 
their invariance with regard to rotations about the origin. 

The degeneracy of energy eigenstates with different values of i is not a general 
property of central forces. It occurs for the Coulomb potential because the Hamil- 
tonian of this system is invariant under a less obvious symmetry, which generates 
an additional constant of the motion. Any departure from a strict 1/r dependence of 
the potential removes this degeneracy. 

Exercise 12.10. Show that the addition of a small 1/r 2 term to the Coulomb 
potential removes the degeneracy of states with different €. The energy levels are 
still given by a Balmer-like formula (12.68), but n differs from an integer by an 
^-dependent quantity, the quantum defect in the terminology of one-electron (alkali) 
spectra. 


The new constant of the motion is the quantum-mechanical analogue of the 
Runge-Lenz vector (apparently first introduced by Laplace): 4 

K = r (LXp-pXL)f- (12.74) 

2 me r 

This is a vector operator that commutes with the Hamiltonian for the hydrogen atom, 

p 2 e 2 

H = f . (12.75) 

2m r 


and has the properties 

L • K = K • L = 0 (12.76) 

Exercise 12.11. Show that K satisfies the condition (11.19) for a vector op- 
erator, that it commutes with the Hamiltonian (12.75), and that (12.76) holds. 


For our purposes, the crucial property of the components of K is that they satisfy 
the commutation relations 


[K x , K y ) = ih 



[K y , K z ] = ih 


[K z , K x ] = 




(12.77) 


4 See Goldstein (1980), p. 102. 
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From this relation it is seen that the vector operator 



(12.78) 


defined in the subspace of bound-state vectors with energy E < 0 satisfies the com- 
mutation relations 

[A x , A y ] = ihL z , [A y , A z ] = ihL x , [A z , A x \ = ihL y (12.79) 


Exercise 12.12. Check the commutation relations (12.77). 

An important identity is obtained from the definition (12.74) through some op- 
erator manipulations: 

2 H 

K 2 = — t (L 2 + h 2 ) + 1 (12.80) 

me 

Restricting ourselves to the subspace of a definite (negative) bound-state energy E, 
we may write this, according to (12.78), as 

. o , o o me 4 

A 2 + L 2 + h 2 = (12.81) 

2 E 

Finally, it is convenient to introduce the operators 

Ji = ^ (L + A) and J 2 = ^ (L - A) (12.82) 

each of which satisfies the angular momentum commutation relations. They have 
the property 

[Jr, J 2 ] - 0 and J \ = i\ (12.83) 

Exercise 12.13. Prove that J x and J 2 satisfy the commutation relations (1 1.24) 
and the conditions (12.83). 


Equation (12.81) is then transformed into 

2(J? + JD + ft 2 = 


4 

me 
2 E 


(12.84) 


Any state in the subspace spanned by the energy eigenstates corresponding to an 
eigenvalue E must be an eigenstate of J 2 = J\. From Section 1 1.2 we know that this 
operator assumes the eigenvalues j(j + l)ft 2 , where j can be any nonnegative integer 
or half-integer. From (12.84) we therefore deduce that 


E = 


4 

me 

2h\2j + l) 2 


(12.85) 


which is identical with the standard form (12.68) for the energy levels of the non- 
relativistic hydrogen atom, if we identify the positive integer 2 j + 1 with the prin- 
cipal quantum number n. 

The two commuting “angular momentum” operators Ji and J 2 can be linked 
to “rotations” in a four-dimensional Euclidean space. The invariance of the Ham- 
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iltonian for the Coulomb interaction under these “rotations” signals the new sym- 
metry that accounts for the degeneracy of the energy levels of the hydrogen atom 
with different € values. This symmetry also explains why it is possible to separate 
the corresponding Schrodinger equation in parabolic as well as spherical coordi- 
nates. We will take advantage of this property in discussing the positive-energy, 
unbound or scattering, eigenstates in Section 13.8. In the meantime, we return in 
the next section to the hydrogenic energy eigenfunctions in spherical polar coordi- 
nates. 


6. The Bound-State Energy Eigenfunctions for the Coulomb Potential. This 
section summarizes the most important properties of the radial bound- state wave 
functions for the attractive Coulomb potential. These functions can be expressed in 
terms of confluent hypergeometric functions, as can be seen when the value given 
by (12.65) for p 0 is put into (12.64): 


flfc+i _ 2(k - N) 

a k ~ (k + l)(fc + 2 € + 2 ) 


( 12 . 86 ) 


The confluent hypergeometric function has already been defined in (8.87) as 


i 


Ffa; c; z) = 1 + - 7 - + 
c 1 ! 


a(a + 1 ) z 2 
cic + 1 ) 2 ! 


(12.87) 


Comparing its coefficients with (12.86), we see that 

w(p) oc ^(-N; U + 2; 2 p) (12.88) 

This can also be seen by comparing the differential equation 

d 2 w dw 

z — y + (c - z) — aw = 0 (12.89) 

dz dz 

which w = 1 Ffa; c; z) satisfies, with the radial equation (12.62) if the latter by the 
use of (12.67) is cast in the form 

p d 2 w „ dw 

y- ~ry F (f + \ — p) — h Nw — 0 (12.90) 

2 dp dp 

The complete normalized wave function can be obtained only if we know the value 
of the normalization integral for the confluent hypergeometric functions. It can be 
shown that 5 



e X x c h Ffa; c; x )] 2 dx = 


(c - 2fl)(-a)![(c - l)!] 2 
(c — a — 1 )! 

(c - 2a)T(l - a)[T(c)] 2 


T(c — a) 


(12.91) 


Since the spherical harmonics are normalized to unity, the complete eigenfunction 
if/(r, 6 , <p) of (12.44) is normalized to unity if 



5 Bethe and Salpeter (1957), Section 3. 
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Since 


p = Kr, a — —N, c — 2€ + 2, 
c — 2a — 2(£ + 1 + N) — 2 n, c — a — 1 — n + £ 

we can write the hydrogenic eigenfunctions, normalized to unity, as 


»- -I I/O 

e~*'{2Kff , <» + tj! 

9 ’ v) 35 [< 2 ") S ( „ - t - ,). 

,F,(-n + € + 1; 2€ + 2; 2 icr)Y?(6, <p) 


(12.92) 

The radial eigenfunctions for the three lowest energy eigenvalues are plotted in 
Figure 12.1 as a function of Zrla. 

Exercise 12.14. Calculate the radial hydrogen energy eigenfunctions for n = 
4 and 5 explicitly, and sketch them graphically. 6 

The polynomial 1 F 1 (— N; c; z ) of degree N and positive integral argument c is 
proportional to the associated Laguerre polynomials of classical mathematical phys- 
ics. The connection is established by the relation 


7-» / , Hi - a)r(c) TC _ 1/ s 

lFfa; c; z ) = — rj- T_ a (z) 

[T(c - a)f 

An elementary definition of the associated Laguerre polynomial is 

L p q - P {z) = (-1 Y~L q (z) 
and the Laguerre polynomial of order q is 

L° q (z) = L q (z) = e z ~ (e~ z z q ) 


(12.93) 


(12.94) 


(12.95) 


Exercise 12.15. Show that the generating function for the associated Laguerre 
polynomials (for |s| ^ 1) is 


1 — s 


Ln(z) 


(12.96) 


(i - *r +i „= 0 (« + P )i v ' 

A few comments concerning the properties of the hydrogenic eigenfunctions 
are appropriate. The wave function possesses a number of nodal surfaces. These are 
surfaces on which ifj = 0. For these considerations, it is customary to refer instead 
of (12.92) to the real-valued eigenfunctions 


</W, « r*e- Kr M-n + € + 1; 2Z + 2; 2 K r)Pf(cos 0)^ . 


cos mcp 


sin mcp 


(12.97) 


6 Brandt and Dahmen (1985), Section 12.4. For good early graphic representations see White 
(1931). See also Pauling and Wilson, Jr. (1935), Section 5.21. 
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Figure 12.1. (continued) 
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There are € - m values of 6 for which P”( cos 6) vanishes; cos m<p and sin m<p vanish 
at m values of the azimuth, and the confluent hypergeometric function l F 1 vanishes 
at n - £ - 1 values of r. For £ + 0, r e has a node at r = 0. Hence, except in S 
states, the total number of nodal surfaces is n, if r = 0 is counted as a surface. 

Important consequences follow from the fact that the wave function vanishes 
at the origin except when £ = 0 (S states). For instance, the capture by a nucleus of 
an atomic electron or any other orbiting negatively charged particle can occur with 
appreciable probability only from a level with £ = 0, because these are the only 
states for which i/j has a nonzero finite value at the position of the nucleus. Similarly, 
in the phenomenon of internal conversion an atomic 5 electron interacts with the 
nucleus and is imparted enough energy to be ejected from the atom in an autoion- 
izing transition. 

The quantum mechanical significance of the Bohr radius a = h 2 /me 2 , can be 
appreciated by observing that the wave function for the Is ground state is 


'Pififii.r, 6, <p) = 




(12.98) 


The expectation value of r in this state is 

<r > = 4 (f ) 3 Jo" exp(-f = 1 1 (,2.99) 

The maximum of the probability density for finding the particle in the ground state 
with a radial separation r from the nucleus, i.e., the maximum of the function 



is located at a/Z. 


Exercise 12.16. Evaluate the width of the probability distribution for the radial 
coordinate r in the ground state of the hydrogenic atom by calculating the uncertainty 
Ar. 


The Bohr radius a is inversely proportional to the mass of the particle that 
moves around the nucleus. Hence, a muon, pion, or kaon in a mesic atom, or an 
antiproton that has been captured by a nucleus, is much closer to the nucleus than 
an electron is in an ordinary atom. The finite size of the nucleus will thus be expected 
to affect the discrete energy levels of exotic atoms appreciably, whereas nuclear size 
effects are very small for electronic states in the bound states of ordinary atoms. 

Many other corrections, of course, must be taken into account when comparing 
the simple Balmer formula (12.68) with the amazingly accurate results of modern 
atomic spectroscopy. Most obviously, we must correct the error made in assuming 
that the nucleus is infinitely massive and therefore fixed. Since for central forces the 
actual two-body problem can, in the nonrelativistic approximation, be replaced by 
an effective one-body problem, if we substitute the reduced mass m 1 m 2 /(m l + m 2 ) 
for m, this correction can be applied accurately and without difficulty. It gives rise 
to small but significant differences between the spectra of hydrogen and deuterium. 
For a positronium atom, composed of an electron and a positron, which have equal 
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masses, all energy levels are changed by a factor 1/2 compared to hydrogen. Further, 
and often more significant, corrections are due to the presence of the electron spin 
and the high speed of the electron, which necessitate a relativistic calculation (Sec- 
tion 24.9); hyperfine structure effects arise from the magnetic properties of the nu- 
cleus; and finally, there are small but measurable effects owing to the interactions 
between the electron and the electromagnetic field (Lamb shift). 7 * The theory of some 
of these effects will be discussed later in this book; others lie outside its scope. But 
all are overshadowed in magnitude by the basic gross structure of the spectrum as 
obtained in this chapter by the application of nonrelativistic quantum mechanics to 
a charged particle subject to the Coulomb potential. 


Problems 


1. Compute (or obtain from mathematical tables) and plot the 10 lowest energy eigen- 
values of a particle in an infinitely deep, spherically symmetric square well, and 
label the states by their appropriate quantum numbers. 

2. If the ground state of a particle in a spherical square well is just barely bound, show 
that the well depth and radius parameters V 0 and a are related to the binding energy 
by the expansion 


2 mV 0 a 2 


77 

—— + 2 Ka + 
4 




(ko ) 2 + ■ ■ ■ 


where 

fiK = V— 2 mE 


The deuteron is bound with an energy of 2.226 MeV and has no discrete excited 
states. If the deuteron is represented by a nucleon, with reduced mass, moving in a 
square well with a = 1.5 fermi, estimate the depth of the potential. 

3. Given an attractive central potential of the form 

V(r) = -V 0 e~ r,a 


solve' the Schrodinger equation for the 5 states by making the substitution 

£ = e~ r/2a 


Obtain an equation for the eigenvalues. Estimate the value of F 0 > if the state of the 
deuteron is to be described with an exponential potential (see Problem 2 for data). 


4, Show that, if a square well just binds an energy level of angular momentum 
£ (=£ 0), its parameters satisfy the condition 


je- i 


2mV 0 a 2 


ft 2 


= 0 


(Use recurrence formulas for Bessel functions from standard texts.) 

5. Assuming the eigenfunctions for the hydrogen atom to be of the form r l3 e~ ar Y%, 
with undetermined parameters a and (3, solve the Schrodinger equation. Are all 
eigenfunctions and eigenvalues obtained this way? 

6. Apply the WKB method to an attractive Coulomb potential, and derive an approx- 
imate formula for the 5-state energy levels of hydrogen. 


7 Gross (1993). For a detailed account of the spectrum of hydrogenic and helium-like atoms, see 

Bethe and Salpeter (1957). 
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7. Compute the probability that the electron in a hydrogen atom will be found at a 
distance from the nucleus greater than its energy would permit on the classical 
theory. Make the calculation for the n = 1 and 2 levels. 

8. Calculate the probability distribution for the momentum of the electron in the ground 
state of a hydrogen atom. Obtain the expectation value of pi from this or from the 
virial theorem. Also calculate { x 2 ) from the ground state wave function, and verify 
the uncertainty relation for this state. 

9. Solve the Schrodinger equation for the three-dimensional isotropic harmonic oscil- 
lator, V = (1/2 )mo) 2 r 2 , by separation of variables in Cartesian and in spherical polar 
coordinates. In the latter case, assume the eigenfunctions to be of the form 

<Kr, 6, <p) = r e exp(-^ r 2 )f(r)Y?(9, cp) 


and show that f(r) can be expressed as an associated Laguerre polynomial (or a 
confluent hypergeometric function) of the variable m(or 2 /h with half-integral 
indices. Obtain the eigenvalues and establish the correspondence between the two 
sets of quantum numbers. For the lowest two energy eigenvalues, show the relation 
between the eigenfunctions obtained by the two methods. 

10. For the isotropic harmonic oscillator of Problem 9, obtain a formula for the degree 
of degeneracy in terms of the energy. For large energies (large quantum numbers), 
compare the density of energy eigenstates in the oscillator and in a cubic box. 

11. Starting with the radial equation 


h 2 cfu 
2m dr 2 


h 2 €(i + 1) Ze 2 


Imr 1 


u — 


u = Eu 


for the hydrogenic atom, show that the transformation 

r = dr 2 , u — VF m, 

produces an equation for u(r) that, with appropriate choices of the constants, is 
equivalent to the radial equation for the isotropic oscillator. Exhibit the relation 
between the energy eigenvalues and the radial quantum numbers for the two systems. 

12. The initial ( t = 0) state of an isotropic harmonic oscillator is known to be an 
eigenstate of L z with eigenvalue zero and a superposition of the ground and first 
excited states. Assuming that the expectation value of the coordinate z has at this 
time its largest possible value, determine the wave function for all times as com- 
pletely as possible. 

13. Solve the energy eigenvalue problem for the two-dimensional isotropic harmonic 
oscillator. Assume that the eigenfunctions are of the form 

= P e exp (~^ p2 )^ exp(±t€<p) 

where p and <p are plane polar coordinates and i is a nonnegative integer. Show 
that f(p ) can be expressed as an associated Laguerre polynomial of the variable 
m(op 2 /h and determine the eigenvalues. Solve the same problem in Cartesian coor- 
dinates, and establish the correspondence between the two methods. Discuss a few 
simple eigenfunctions. 

14. Apply the variational method to the ground state (€ = 0) of a particle moving in an 
attractive central potential V(r) = Ar n (integer n > —1), using 

R{r) = <T' 3r 

as a trial wave function with variational parameter /3. For n = — 1 and +2, compare 
the results with the exact ground state energies. 
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15 . Apply the variational method to the ground state (€ = 0) of a particle moving in an 
attractive (Yukawa or screened Coulomb or Debye) potential 

-r/a 

V(r)=-V 0 —- (V 0 > 0) 

r/a 

Use as a trial function 

R(r) = e~ yr/a 

with an adjustable parameter y . Obtain the 4 ‘best” trial wave function of this form 
and deduce a relation between y and the strength parameter 2 mV 0 a 2 th 2 . Evaluate y 
and calculate an upper bound to the energy for 2 mV 0 a 2 lh 2 = 2.7. 

Are there any excited bound states? 

Show that in the limit of the Coulomb potential (U 0 — » 0, a — » °°, V 0 a finite) 
the correct energy and wave function for the hydrogenic atom are obtained. 

16 . Using first-order perturbation theory, estimate the correction to the ground state 
energy of a hydrogenic atom due to the finite size of the nucleus. Under the as- 
sumption that the nucleus is much smaller than the atomic radius, show that the 
energy change is approximately proportional to the nuclear mean square radius. 
Evaluate the correction for a uniformly charged spherical nucleus of radius R. Is the 
level shift due to the finite nuclear size observable? Consider both electronic and 
muonic atoms. 

17 . An electron is moving in the Coulomb field of a point charge Ze, modified by the 
presence of a uniformly charged spherical shell of charge —Z'e and radius R, cen- 
tered at the point charge. Perform a first-order perturbation calculation of the hy- 
drogenic 15, 25, and 2 P energy levels. For some representative values of Z = Z', 
estimate the limit that must be placed on R so that none of the lowest three energy 
levels shift by more than 5 percent of the distance between the unperturbed first 
excited and ground state energy levels. 



CHAPTER 13 


Scattering 


Much of what we know about the forces and interactions in atoms and r 
subatomic particles has been learned from collision experiments, in which 
atoms in a target are bombarded with beams of particles. Particles that are 
scattered by the target atoms are subsequently detected by devices that 
may give us the intensity as a function of the scattering angle (and 
possibly of the energy of the scattered particles, if inelastic processes are 
also involved). We begin the discussion with a general introduction to the 
concept of a cross section, since it constitutes the meeting ground 
between experimentalists and theoreticians for all varieties of collisions. 
We then establish the connection between calculated quantum mechanical 
amplitudes and measured cross sections for collisions in which elastic 
scattering is the dominant process. The stock-in-trade of scattering theory 
is developed: incoming and outgoing Green’s functions, quantum 
mechanical integral equations, the Born approximation, and partial wave 
and phase shift analysis. Later, in Chapter 20, we will take a second 
comprehensive look at collision theory in the context of the general 
principles of quantum dynamics, to be developed in Chapters 14 and 15. 1 

1 . The Cross Section. Some form of collision experiment is the most common 
tool for probing atomic and subatomic interactions. Collisions of nucleons with nu- 
clei at various energies reveal information about the nuclear forces and the structure 
of the nucleus. Electrons of high energy, hence short wavelength, are particularly 
well suited to determine the charge distribution in nuclei, and indeed within nucle- 
ons. Electrons and heavier projectiles of low energy are scattered from atoms to 
obtain data that can serve as input information for calculations of kinetic processes 
in gases where low-energy collisions predominate. And collisions of hadrons and 
leptons with protons tell us about interactions of which we have no other direct 
information. These are just a few examples of the utility of collisions in studying 
the internal structure of atoms and nuclei and the interactions that govern elementary 
particles. 

Usually, we know the nature of the particles used as projectiles, their momen- 
tum, and perhaps their polarization (defined in Chapter 16). The collision between 
a projectile and a target particle is sometimes referred to as a binary collision, in 
order to distinguish it from the kind of interaction that takes place when an incident 
beam interacts with a large number of target atoms, as happens in diffraction of 
electrons, neutrons, or even entire atoms from a lattice. The target particle is fre- 
quently at rest (or nearly so) before the collision, but its thermal motion cannot 
always be neglected. In some experiments, the target atoms are in gas jets, with 
controlled initial velocity, usually at right angles to the incident beam. In merging 
or colliding beam experiments, both projectile and target particles initially move 


’Goldberger and Watson (1964), Newton (1982), and Taylor (1972) are treatises on collision 
theory. Mott and Massey (1965) and Bransden (1983) apply the theory to atomic collision problems. 
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along the same direction, either parallel or antiparallel, and with controlled initial 
velocity. 

Experimentalists typically measure intensities and yields of certain collision or 
reaction products that result from the interaction of incident particles or waves with 
target “atoms,” upon which the projectiles impinge, although the “atoms” can be 
any kind of object (particle or wave) that affects the incident beam. The comparison 
between measurements and theoretical predictions is made in terms of the cross 
section for the particular process under consideration. Broadly speaking, in physics, 
a cross section measures the size of the effective area that must be hit, if an incident 
charged or neutral projectile particle is to cause a certain specified effect in a given 
target particle, which may also be either neutral (stationary target) or charged (col- 
liding beams). The relative velocity (collision energy) of the two interacting particles 
is usually specified at incidence. The experiment illustrated in Figure 13.1 involves 
a collimated homogeneous beam of monoenergetic particles moving in the same 
sharply defined direction toward the scatterer from a great distance. Good resolution 
in incident velocity (magnitude as well as direction) is prized in all collision exper- 
iments. The width of the beam is determined by slits, which, though quite narrow 
from an experimental point of view, are nevertheless very wide compared with the 
cross section. Experimentally, in the interest of securing “good statistics,” it is 
desirable to employ beams with high incident intensity, or luminosity. Yet the beam 
density must be low enough that it can safely be assumed that the incident particles 
do not interact with one another. A precise knowledge of the number of projectiles, 
either per pulse or per unit time, is essential. 

The interpretation of most experiments requires accurate knowledge of the 
physical properties of the target, especially the number of target particles that are 
exposed to the incident beam. If the target is solid, it is assumed (unless otherwise 
specified) that the beam is incident normal to the surface of the target. 

Prototypically, a cross section measurement is similar to the (“Monte Carlo”) 
determination of the size of the bull’s eye on a dart board by recording the successful 
hits among randomly thrown missiles directed blindly toward the target, provided 
we know the average number of projectiles striking the target per unit area. However, 
in the atomic and subatomic domain the cross section, which is a physical property 
of the target particle and its interaction with the projectile, generally bears no direct 
relation to the geometric size of the target particle. 



Figure 13.1. Sketch of a scattering experiment. A pulse of N P = I 0 A projectile particles is 
incident from the left with velocity v 0 . The beam intersects an area A of the target 
(thickness t and particle density ri) and encounters N r = ntA target particles. The detector 
D registers I x outcomes X, and the corresponding cross section is u x — I X /N T I 0 ■ 
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Let us suppose that we are interested in determining the likelihood of an out- 
come X in a binary collision. In measurements, the outcome X might refer to any of 
a number of physical processes, as for example: 

(a) Scattering, with or without loss of energy, into a well-defined solid angle 
tilted at a specified direction with regard to the direction of incidence. This direction 
is defined by the position of the detector. If the collision is not elastic, the energet- 
ically degraded projectile and the excitation of the target may be observed simul- 
taneously by a coincidence measurement of some outgoing radiation or particle 
emission. Sometimes the recoil of the target particle is observed. 

(b) Absorption of the incident projectile by the target, measured either through 
the attenuation of the incident beam or through observation of reaction products, 
e.g., the creation of new particles, the emission of radiation or particles, or chemical 
changes in the composition of the target. 


Let cr x stand for the cross section that corresponds to the outcome X. All cross 
sections have the dimension (units) of an area, although differential cross sections 
may be defined and quoted as “area per unit measure of X.” Let A denote the area 
of overlap of the beam of projectiles and the assemblage of target particles, measured 
at right angles to the relative velocity of the projectile-target system (Figure 13.1). 
(In the case of a stationary target, this is just the direction of the incident beam.) 
We assume that the experimental arrangement allows us to choose A » cr x . The 
dimensionless ratio cr x /A is the probability of the outcome X in a binary collision 
of a projectile particle with a target particle, provided that the presence of the other 
projectile and target particles does not affect the binary collision. Under these con- 
ditions if, within the common overlap area A, the assemblage of projectiles contains 
N P particles and the target assemblage contains N T particles, the total number I x of 
detectable outcomes with property X is the product: 


ov 

1 x = — N P N T 
A 


(13.1) 


Since experiments have to be conducted within a reasonable length of time, in 
laboratory practice steady and stable beams are characterized by the number of par- 
ticles per unit time. If the product N P N T is interpreted as the number of encounters 
between projectile and target particles per unit time, I x measures the rate or yield 
of emerging particles with outcome property X per unit time. If the detector is 
“ideal,” i.e., 100 percent efficient, the quantity I x is known as the counting rate for 
the process that is being observed. Expression (13.1) is directly applicable to the 
colliding beam geometry in particle accelerators, and the quantity L = N P N T /A is 
generally referred to as the luminosity (in units of sec -1 cm -2 ). The magnitude of 
the luminosity is critical if reasonable counting rates are to be achieved. (In typical 
high-energy particle accelerators, luminosities are of the order 10 32 sec -1 cm -2 for 
colliding beams.) 

In the more com m on geometry of stationary targets exposed to projectile beams, 
Eq. (13.1) is usually written in a less symmetric form. The ratio I 0 = N P /A, the 
incident intensity, is the number of particles incident on the target per unit area and 
unit time. The number N T of target particles exposed to the beam can be expressed 
as 


N r — ntA 


(13.2) 
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where n is the number of target particles per unit volume and t is the thickness of 
the target. If the binary collision between a projectile particle and any target atom 
is unaffected by the presence of the other target atoms, the product cr x nt is the 
fraction of the total area A over which beam and target particles interact that is 
effective in producing the outcome X. This must be less than unity: 


i 


(TV 

N t — — ntcr x < 
A 


1 


setting an upper limit to the thickness t. The target is said to be “thin” if 
t « 1 !na x . For a thin target, Eq. (13.1) can be reexpressed as 


ntcry = 


I x _ counting rate for detectable particles with property X 
AI 0 number of particles incident on the target per unit time 

l 

For the cross section cr x we thus obtain the formula 


(13.3) 


o- x 


ntAI () 


(13.4) 


The outcome X in a collision experiment may be a discrete physical character- 
istic, such as the “yes/no” choice in a nuclear reaction, which either does or does 
not occur. Similarly, a total scattering cross section measures only the occurrence 
of the scattering process, and not any specific feature of the scattered particle. On 
the other hand, X may stand for a physical process that is described by one or more 
variables x with continuous values, such as scattering in a particular spatial direction 
or with an energy loss that lies in a certain continuous range. The cross section is a 
measure of the probability that the outcome lies in the interval between x and x + dx 
and is usually proportional to dx. It is then customary to write this cross section in 
the form dcr x and to define the ratio da x ldx as the differential cross section for the 
outcome X with value x. 

An example is the differential scattering cross section, da/dfl, corresponding 
to scattering into a solid angle dCl, defined as the angle subtended at the target by 
the detector in a given direction (Figure 13.2). Here, the outcome X is detection of 
the particles after scattering at a great distance from the scatterer into the solid angle 
dfl in a direction specified by spherical coordinates 6, ip that are related to the 
direction of incidence, chosen as the polar axis. If the number of scattered particles 
per unit solid angle is denoted by 1(6, ip), the relation (13.4) becomes the expression 
for the differential cross section, or angular distribution. 


da _ 1(6, <p) 
dfl ntAI 0 


(13.5) 


This is the quantity which the experimentalist delivers to the theoretician, who in- 
terprets the cross section in terms of probabilities calculated from wave functions. 

The outcome X of a collision experiment is defined by the physical arrangement 
of the measurement. The initial collision parameters, including the relative velocity 
or kinetic energy of the incident particles, the intensity of the incident beam, and 
the velocity (if different from zero) of the target atoms, are usually determined and 
quoted with respect to a laboratory frame-of-reference. Yet, the theoretical analysis 
is often carried out in an entirely different frame of reference, such as the center- 
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Figure 13.2. Scattering of a wave packet incident from the left with mean momentum ftk 0 . 
In elastic scattering, the wave packet is scattered with equal energy but varying amplitudes 
in different directions (indicated by the thickness of the arrows). In the forward direction, 
the scattered wave interferes with the advancing incident wave. 


of-mass frame in which the total momentum of the system is zero, or with the use 
of relative coordinates appropriate to the interaction between two particles. The cross 
section depends on the collision parameters (incident energy, polarization, etc.) and 
the particular variables (scattering angle, energy loss, etc.) which quantify the out- 
come X. If these variables are denoted by x in the theoretically preferred frame-of- 
reference, a transformation to the corresponding variables, x', in the laboratory 
frame must be carried out. Formally, the differential cross-sections in the two frames 
are related by 

dcr x dcr x dx _ 

<136) 

This relation cautions us that the two differential cross sections do not have the same 
value. A numerical example at the end of this section illustrates this for the angular 
distribution. 

We now turn to the theoretical calculation of a scattering cross section. In an 
idealized scattering process, a single fixed scattering center or target particle is bom- 
barded by particles incident along the z axis. It is assumed that the effect of the 
scattering center on the particles can be represented by a potential energy V(r) which 
is appreciably different from zero only within a finite region. Although this as- 
sumption excludes as common a long-range force as the Coulomb field, represented 
by a potential proportional to l/r and observable as Rutherford scattering, this lim- 
itation is not severe. In actual fact, the Coulomb field is screened at large distances 
by the presence of electrons in atoms and by other particles, and for large r the 
potential falls off faster than l/r. 

In the classical limit, each incident particle can be assigned an impact param- 
eter, i.e., a distance b from the z axis, parallel to which it approaches from infinity, 
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X 



Figure 13.3. Scattering through an angle 6 of a particle beam incident at impact parameter 
b and azimuth angle < p. Classically, the cylinder bounded by b . . b + db and <p . . <p + d<p 
defines the cross section area da. 


and an azimuth angle <p, which together with z define its position in cylindrical 
coordinates. For particles moving on classical orbits, the final asymptotic direction 
9, <p is determined by these initial coordinates and by the incident energy. The cross 
section da is simply equal to the size of the area which, when placed at right angles 
to the incident beam, is traversed by all those orbits that asymptotically end up in 
the solid angle dCL around the direction 9, <p. Figure 13.3 illustrates the point. Ob- 
viously, this discussion must be refined if the scattering angle is not a continuous 
smooth function of the impact parameter or of the azimuth angle of the incident 
particle, but we ignore such singular situations (e.g., rainbow scattering). 

If the scattering potential is spherically symmetric, V — V(r), the orbits lie in 
planes through the center of force, and the scattering becomes independent of the 
azimuth angle <p. Therefore, classically, 


da — b db dip = b 


db 

d cos 6 


| d cos d\ d<p = b 


db 

d cos 9 


d£l 


(13.7) 


from which we see that 



db 

b 

db 

d cos 9 

sin 9 

d9 


(13.8) 


Hence, the differential cross section is calculable if, for the given energy, we know 
b as a function of the scattering angle 9. To determine this function from the equation 
of motion is the problem of classical scattering theory. 


Exercise 13.1 In Rutherford scattering of a particle of mass m and incident 
energy E, with potential energy V(r) = C/r (C constant), the functional relationship 
between the impact parameter b and the scattering angle 9 is given by 
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Derive the differential scattering cross section for Rutherford scattering: 

da _ Z\Z\e A 1 
dil ~ 16 E 2 sin 4 (0/2) 


(13.9) 


where Z x e and Z 2 e are the nuclear charges of the projectile and target particles, 
respectively. 


Equation (13.8) is a good approximation only if the de Broglie wavelength of 
the incident particles is much smaller than the dimensions of the scattering region. 
It ceases to be useful when the wave description can no longer be adequately ap- 
proximated by the geometrical optics limit. As the wavelength increases, quantum 
features make their appearance, and the quantum uncertainty begins to limit the 
simultaneous knowledge of, say, p x , which for a well-collimated beam should be 
close to zero, and *, which is proportional to the impact parameter. Thus we must 
learn how to calculate cross sections from wave functions. 

The most significant quantum mechanical aspect of scattering is that we are 
now concerned with unbound particles and the continuous part of the energy spec- 
trum. The Schrodinger equation for unbound eigenstates must now be solved, cor- 
responding to positive energy eigenvalues, and the connection between eigenfunc- 
tions and measured intensities must be established. 

Quantum mechanics represents the particles in the beam by wave packets of 
various shapes and sizes. We would like to suppose that all the particles in the beam 
with incident intensity I 0 can be represented by very broad and very long wave 
packets propagating in the z direction and that, before they reach the neighborhood 
of the scatterer, these packets can be described simply and approximately by infinite 
plane waves e i(kz ~ wt \ although strictly speaking the waves do not extend to infinity 
either in width or in length. After scattering, the particles are detected at a great 
distance from the scatterer, and we would like to describe them again, asymptoti- 
cally, by simple, radially outgoing harmonic waves, but with direction-dependent 
amplitudes (Figure 13.2). 

Only comparison between theory and experiment can tell us whether our as- 
sumption that all the particles in the beam can be represented by very broad and 
long wave packets truly reflects the properties of a real beam, consisting, as it does, 
of particles that were emitted from a source, that have perhaps undergone some 
acceleration, and that may have been selected as to energy and momentum by ana- 
lyzers and slits. Fortunately, the mathematical analysis of the next section will show 
the cross section to be in general independent of the peculiarities of the incident 
wave packet, provided only that the packet is large compared with the scattering 
region toward which it is directed. 

We conclude this section with a concrete illustrative example. Consider the 
elastic Rutherford scattering of 10-MeV alpha particles from an aluminum foil of 
t = 1 p m thickness. We will see in Section 13.8 that quantum mechanics gives 
essentially the same results for this force law as classical mechanics, allowing us to 
employ the differential cross section (13.9) for scattering of alpha particles. Since 
the atomic mass of aluminum is about 27 and its density is 2.7 g cm -3 , the number 
of A1 atoms per cm 3 is n = 6 X 10 22 cm -3 . The number of atoms per cm 2 of area 
is thus nt = 6 X 10 18 cm -2 . The reduced mass of the ct-Al system is 4 X 27/31 = 
3.48 u (instead of approximately 4 u for an a particle colliding with an infinitely 
massive nucleus). Hence, the energy of the relative motion is 3.48/4 times the in- 
cident energy, or 8.7 MeV. Given that the charges of the colliding nuclei are 2e (for 
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m P 

V G ~ m P + m T v 0 



Figure 13.4. Kinematics of elastic scattering in the center-of-mass and laboratory 
coordinate frames. In the laboratory frame, the target particle is initially at rest, the 
projectile has velocity v 0 , and the center of mass has constant velocity v G . In the center- 
of-mass frame \ P and \ T are the velocities of the scattered projectile and the recoiling 
target particle, respectively. In the laboratory frame 9' and 0' R are the scattering and recoil 
angles, respectively, and \' P and y' T are the final laboratory velocities. A mass ratio 
m p /m T — 3/5 has been assumed in this diagram. 


He) and 13e (for Al), their classical distance of closest approach in a head-on col- 
lision is calculated from conservation of energy as 4.3 X 10 -15 m or 4.3 fm. 

For elastic collisions, the kinematic relation between the scattering angle 6 in 
the center-of-mass or relative coordinate frame and the scattering angle 6' in the 
laboratory frame-of-reference is 


tan d' 


sin 9 


cos d + 


m P 

m T 


(13.10) 


where m P is the mass of the incident projectile and m T is the mass of the initially 
stationary target (Figure 13.4). 

For example, corresponding to 9' = 30° in the laboratory frame, the scattering 
angle in the center-of-mass system (as well as in the relative coordinate frame) is 
9 = 34.2°. The distance of closest approach corresponding to this scattering angle 
is 9.5 fm, which is close to the nuclear radius of aluminum, so that small departures 
from Rutherford scattering may occur. We then calculate the Rutherford differential 
cross section in the center-of-mass frame for this scattering angle as 


J 

— — 1.53 X 10 -28 m 2 = 1.53 X 10“ 24 cm 2 = 1.53 bams 

ui 2 


This is the differential cross section per steradian. At this scattering angle one finds 
that 
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and, for the differential cross section in the laboratory frame, 


da 

diV 


da sin 0 
dfL sin O' 


= 1.95 bams = 1.95 X 10 24 cm 2 


If the solid angle subtended by the detector is dCt' — 10 -6 steradian, the calculated 
cross section for this outcome, is da = 1.95 X 10 -30 cm 2 , and the probability that 
an a particle is scattered into the detector with ideal detection efficiency is nt da = 
1.17 X 10 1X . In other words, it is expected that about one in 10 11 of the a particles 
that are incident on the target will be scattered into the detector at this scattering 
angle. If the rate of incident a particles is I 0 = 10 12 sec -1 , the counting rate at the 
detector will be about 12 particles per second. (The luminosity in this experiment 
is I 0 nt = 10 12 X 6 X 10 22 X 10 4 = 6 X 10 38 sec -1 cm -2 .) 

It is important to ask if the energy of a 10-MeV a particle is appreciably de- 
graded as it passes through the 1 micron aluminum foil. Since the stopping power 
of aluminum for protons of the same velocity (2.5 MeV) is about 250 MeV/cm, an 
a particle (charge 2e) loses about 0.1 MeV, or only 1 percent of the original energy, 
in its passage through the foil. Finally, one has to estimate the likelihood of multiple 
Rutherford scattering in the target. For the foil thickness assumed in our example, 
this effect is found to be quite small. 


Exercise 13.2 . Calculate the probability that in the example of Rutherford 
scattering just given an alpha particle is scattered through an angle greater than 10 -2 
radian in the laboratory frame. Use the result to estimate the importance of multiple 
scattering in this example. 

2. The Scattering of a Wave Packet. Let us assume that the motion of the pro- 
jectiles in the field of a single scatterer ( N T = 1) is described by a Hamiltonian 

P 2 

H = Z~ +V=T+ V=H 0 + V (13.11) 

2m 

where V is appreciably different from zero only within 'a sphere of radius a sur- 
rounding the coordinate origin. At t = 0 a particle is represented by a wave packet 
of the general form 

i//(r, 0) = N J cf>(k) exp[ik • (r - r 0 )] d 3 k (13.12) 

where <f>( k) is a smooth function of narrow width, Ak, centered around a mean 
momentum hk 0 . The normalization constant N is arbitrary. The initial state i[/(r, 0), 
as expressed by (13.12), is thus an extended wave packet located in the vicinity of 
r 0 , but with fairly sharply defined momentum. We assume that k 0 is parallel to r 0 , 
but in an opposite direction, so that the wave packet, if unhindered in its motion, 
would move freely toward the coordinate origin; and we assume r 0 to be so large 
that if/ at t = 0 lies in its entirety well to the left of the scatterer (Figure 13.2). 

The dynamical problem to be solved is as follows: What is the shape of the 
wave packet (13.12) at a much later time, when the packet has encountered the 
scatterer and been eventually dispersed by it? In principle, the answer can be given 
easily, if we succeed in expanding i/r( r, 0) in terms of eigenfunctions, ij/ n (r), of H. 
Indeed, if we can establish an expansion 

r, 0) = 2 c n ift„(r) 

nl 
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then the wave packet at time t is 

<A(r, 0 = 2 c„^„(r) exp ~~ E n t 

n 1 \ ft 

How can this program be applied to a wave packet of the form (13.12)? True, it is 
an expansion in terms of a complete orthonormal set, the plane waves e‘ k r , but these 
are eigenfunctions of the operator H 0 , and not of H. However, it will be proved that 
it is possible to replace the plane wave functions in the expansion by particular 
eigenfunctions of H, which we will designate as i/4 +) (r), if <A(r> 0) is a wave packet 
of the kind described. Asymptotically, the eigenfunctions t/4 +) (r) bear a considerable 
resemblance to plane waves, since they are of the form 



(13.13) 


This differs from a plane wave at large r only by an outgoing spherical wave. As 
we will see in Section 13.3, such solutions of the Schrodinger equation 

tf</4 +) (r) = £</4 +) ( r) (13.14) 

do indeed exist. In order not to interrupt the argument, we assume their existence 
in this section. It will also be shown in Section 13.3 that the special wave packet 
i/r(r, 0) has the same expansion coefficients, whether it is expanded in terms of plane 
waves, as in (13.12), or in terms of the set i/4 +) (r): 


<K r, 0) = J 4>( k)<T' kr °(/4 +) (r) d 3 k (13.15) 

In other words, the outgoing wave in (13.13) asymptotically makes no contribution 
in the initial wave packet. The replacement of (13.12) by (13.15) is the critical step 
in this analysis. 2 

Once the initial wave function i/r(r, 0) has been expanded in terms of eigen- 
functions of H, we can write at any time t, 

t/t(r, t) = j <t>(k)e- ikr °- ia>t ifr ( k +) (r) d 3 k (13.16) 


where 


hoj 


tfk 2 

2m 


is the energy eigenvalue for the eigenfunction i/4 +) (r). 

Assuming that the scattering detectors are at a macroscopic distance of the order 
of r 0 from the scatterer, after a time 


T = 


2 mr 0 
hk 0 


(13.17) 


2 The discussion of the time development of wave packets in scattering follows lecture notes by 

F. E. Low. 
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the broad pulse will be traveling through the position of the detectors (see Figure 
13.2). When we examine the pulse at the position of the detector, i/4 +) (r) can again 
be represented by its asymptotic expansion, but since the phases have changed with 
time the outgoing wave may no longer be neglected. However, as in (2.29), we can 
make an expansion about k^\ 

e = e 0 + 2(k - k 0 ) • k 0 + (k - k 0 ) 2 (13.18) 


In order to be able to neglect the last term, when k 2 is substituted at t = T into the 
expression 


hie 2 

toT = T 

2m 


in Eq. (13.16), we require that T, though large, should still satisfy the inequality 


h 

2m 


(k - k 0 ) 2 T « 1 


or, using (13.17), 


(A k) 2 r 0 
ko 


« 1 


This condition, familiar from Section 2.4, implies that the wave packet spreads neg- 
ligibly when it is displaced by the macroscopic distance r 0 . 


Exercise 13.3. Show, by numerical example, that in scattering of particles 
from atoms and nuclei the condition of no spreading can be easily attained by a 
minimum uncertainty wave packet (AkAr ~ 1), which is large compared with the 
scattering region but small compared with r 0 . 


Approximately and asymptotically, we can thus write (13.16) as 


where 


r 

j/Kr, t) = <£(k) exp[-/k • (r 0 + v 0 r) + 


ico 0 t] 


ikr\ 


X at! + / k (?) — 1 d 5 k 


& k 0 , 1 2 

v 0 = , no) 0 = ~ mvo 

m 2 


(13.19) 


Comparing (13.19) with (13.12), and assuming that / k (r), unlike <f)( k), is a slowly 
varying function of k, we can write 


t) = i/r(r - v 0 t, 0)e i( ° ot + N 


r 


<fr(k) exp [i[kr - k • (r 0 + v 0 ?)]} d 3 k 


with carets denoting unit vectors. Since for a well-collimated beam, <£(k) is appre- 
ciably different from zero only for k ~ k 0 , we can write in the exponent effectively, 

kr ~ k • k 0 r 


(13.20) 
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and consequently, again by comparison with (13.12), asymptotically, 

iKr, 0 = iA(r - V, OK'^ + iA(rk 0 - \ 0 t, 0)e iw °‘ (13.21) 

r 

Except for the phase factor e“° ot , the first term on the right-hand side represents the 
initial wave packet displaced without change of shape, as if no scattering had oc- 
curred, i.e., as if V were absent from the Hamiltonian. 

A 

The factor ^(rk 0 — v 0 f, 0) in the second term of (13.21) can be expressed as 
ip([r - rV]ko> 0)- It differs from zero only when r - v 0 t ~r Q , within a range A r 
that is the same as the spatial width of the initial wave packet, and represents a 
radially expanding image of the incident wave packet. A detector placed at distance 
r is reached by this wave packet at time t = (r 0 + r)tv 0 , as expected for uniform 
particle motion. Thus, the second term in (13.21) is a scattered spherical wave 
packet, reduced in amplitude by the factor 1/r and modulated by the angular ampli- 
tude / ko (r). Sensibly, the latter is called the scattering amplitude. 

The assumption, made in deriving (13.21), of a slow variation of the scattering 
amplitude with k, excludes from this treatment those scattering resonances which 
are characterized by an exceptionally rapid change of the scattering amplitude with 
the energy of the particle. When a resonance is so sharp that the scattering amplitude 
changes appreciably even over the narrowest tolerable momentum range A k, the 
scattered wave packet may have a shape that is considerably distorted from the 
incident wave packet, because near such a resonance different momentum compo- 
nents scatter very differently. There is then strong dispersion. (See Section 13.6 for 
some discussion of resonances.) 

The probability of observing the particle at the detector in the time interval 
between t and t + dt is 

y 0 — — y-— 1 1 J/([r — u 0 r]k 0 , 0) | 2 r 2 d£l dt 
r 

Hence, the total probability for detecting it at any time is 

r + oo r+ oo 

u 0 |/ko(r)| 2 dil | <K[r - Mko, 0)| 2 dt = j/^f)! 2 da \ k 0 , 0)| 2 d$ 

J — oo J — oo 

(13.22) 

The limits of integration in (13.22) may be taken to be with impunity, since 

t) describes a wave packet of finite length. The probability that the incident 
particle will pass through a unit area located perpendicular to the beam in front of 
the scatterer is 

J + J |<K£ko, 0 )| 2 d£ 

There is no harm in extending this integration from — °° to +°° either, since at 
t = 0 the wave packet is entirely in front of the scatterer. If the ensemble of pro- 
jectiles contains N P particles, all represented by the same general type of wave 
packet if/i (i = 1, . . . N P ), the number of particles scattered into the solid angle da 
is 

/(f) da = l/yf)! 2 da 2 f | ^ 0 , o )| 2 ^ 

i=i /-“> 
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whereas 

N p /*+ oo 

Io = 2 |«A,'(^ko,0 )| 2 d£ 

i=i J - “ 

gives the number of particles incident per unit area. Hence, by (13.5) and setting 
N t = ntA - 1 for a single scattering center, we arrive at the fundamental result: 

r 

(13.23) 

establishing the fundamental connection between the scattering amplitude and the 
differential cross section. This result is independent of the detailed behavior of the 
projectile wave packets and the normalization constant N. 

3. Green’s Functions in Scattering Theory. To complete the discussion, it is 
necessary to show that 

(a) eigenfunctions of the asymptotic form (13.13) exist, and 

(b) the expansion (13.15) is correct. 

These two problems are not unconnected, and (b) will find its answer after we 
have constructed the solutions i/4 +) (r). The method of the Green’s function by which 
this may be accomplished is far more general than the immediate problem would 
suggest. 

The Schrodingcr equation to be solved is 

[~h v2 + = E * 

or 

(V 2 + k 2 )if/ = Uif/ ‘ (13.24) 

where k 2 = 2mE/h 2 and U = 2 mV/h 2 . It is useful to replace the differential equa- 
tion (13.24) by an integral equation. The transformation to an integral equation is 
performed most efficiently by regarding Uif/ on the right-hand side of (13.24) tem- 
porarily as a given inhomogeneity, even though it contains the unknown function 
if/. Formally, then, a particular “solution” of (13.24) is conveniently constructed in 
terms of the Green’s function G(r, r'), which is a solution of the equation 

(V 2 + e)G{ r, r') = -47rS(r - r') (13.25) 

Indeed, the expression 

f G(r, r')U(r')if/(r') d 2 r' (13.26) 

47 T J 

solves (13.24) by virtue of the properties of the delta function. To this particular 
solution we may add an arbitrary solution of the homogeneous equation 

(V 2 + k 2 )^ = 0 



(13.27) 
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which is the Schrodinger equation for a free particle (no scattering). Leaving a nor- 
malization factor N undetermined, we thus establish the integral equation 


tp k(r) = Ne lkr 



G(r, r')U(r')tp k (r') d 3 r' 


(13.28) 


for a particular set of solutions of the Schrbdinger equation (13.24). The vector k 
has a definite magnitude, fixed by the energy eigenvalue, but its direction is unde- 
termined. Thus, the solution exhibits an infinite degree of degeneracy, which cor- 
responds physically to the possibility of choosing an arbitrary direction of incidence. 

Even if a particular vector k is selected, (13.28) is by no means completely 
defined yet: The Green’s function could be any solution of (13.25), and there are 
infinitely many different ones. The choice of a particular G(r, r') imposes definite 
boundary conditions on the eigenfunctions </f k (r). Two particularly useful Green’s 
functions are 


G±(r, r') 


exp(±tfc|r - r'j) 


(13.29) 


Exercise 13.4. Verify that the Green’s functions (13.29) are solutions of the 
inhomogeneous equation (13.25). 


(13.30) 


A host of Green’s functions of the form 

G(r, r') = G(r - r') 
may be obtained by applying a Fourier transformation to the equation 

(V 2 + e)G{ r) = -4rrS(r) (13.31) 

which is a simplified version of (13.25). If we introduce the Fourier integral 


G(r) = J g ( k V k ' r d 3 k’ 


and the Fourier representation of the delta function, we obtain by substitution into 
(13.31) 


1 


1 


g(k,) 2 t t 2 k' 2 - k 2 

hence, the Fourier representation 


G(r) 


— f — i P ikt d 3 k' 

i 2 7T 2 k ,2 -e 


(13.32) 


Integrating over the angles, we obtain, after a little algebra, the convenient form 


G{r) 




Jk'r 


k >2 - e 


k' dk' 


(13.33) 


Since the integrand has simple poles on the real axis in the complex k '-plane, at 
k' = ±k, the integral in (13.33) does not really exist. This suggests that our attempt 
to represent the solutions of (13.25) as Fourier integrals has failed. Nevertheless, 
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this approach has the potential to succeed, because the integral in (13.33) can be 
replaced by another one that does exist, thus 


G +V (r ) 


-f 


Ak'r 


k' 2 — (k 2 + iri) 


k' dk' 


(13.34) 


where 77 is a small positive number. So defined, G +V (r) exists but is no longer a 
solution to (13.31). The trick is to evaluate the expression (13.34) for 17 + 0 and to 
let 7j -*• 0, i.e., G +V (r ) — > G + (r), after the integration has been performed. 

Alternatively, we might say that a unique solution of (13.31) does not exist, for 
if it did G would be the inverse of the operator — (l/47r)(V 2 + k 2 ). But this operator 
has no inverse, because the homogeneous equation (V 2 + k 2 ) t]/ = 0 does have non- 
trivial solutions. However, the inverse of — (l/47r)(V 2 + k 2 + irj ) exists and is the 
Green’s function G +V (r ) in (13.34). (See also Section 18.2.) 

The integral 



-ik'r 


r > 2 


(k 2 + irj) 


k' 


dk' 


is most easily computed by using the complex plane as an auxiliary device (see 
Figure 13.5). The poles of the integrand are at 


k' 


= ±Vk 2 + irj ~ 




for small 17. The path of integration leads along the real axis from — 00 to + 00 . 

Since r is necessarily positive, a closed contour may be used if we complete 
the path by a semicircle of very large radius through the upper half k' -plane. It 
sncloses the pole in the right half plane. The result of the integration is not altered 
by introducing detours avoiding the two points k' — +k and k' = —k, as indicated 


Im k' 
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in the figure. If this is done, the limit 17 — > 0 can be taken prior to the integration, 
and we may write 

G + (r) = lim G +V (r ) - — <fi — f _ k' dk' (13.35) 

^0 nrr J k ,z - Jr 

which by the use of the residue at k' = k becomes 



(13.36) 


If we replace tj in (13.34) by —17(17 > 0), we obtain a second Green’s function 

-ikr 

G-(r) = lim G-Jr) = (13.37) 

19 — >0 r 

in agreement with (13.29). Still other Green’s functions can be formed by treating 
the singularities that appear in (13.33) differently. 

Exercise 13.5. Show that the Green’s function 

Gi(r) = ]- G+(r) + G_(r) = (13.38) 

2 2 r 

is obtained if the integral in (13.33) is replaced by its (Cauchy) principal value. 


The Green’s functions (13.36), (13.37), and (13.38) may be identified as out- 
going, incoming, and standing waves. To appreciate this terminology, we need only 
multiply them by the time factor exp (—iEtJh). Furthermore, the description of scat- 
tering in terms of wave packets suggests the designation retarded for G+ and ad- 
vanced for G_. 

When the special forms (13.36) or (13.37) are substituted in (13.28), two dis- 
tinct eigensolutions result, denoted by 1 f/ (+) and They satisfy the integral equa- 
tion, 


</4 ±} (r) = Ne ikT 


1 f exp(±z£jr — r' j) 
47T J |r — r' 


U( r'WSfV) d 3 r' 


(13.39) 


which is the Schrodinger equation rewritten in a form that is particularly convenient 
for use in scattering theory. We must show that in the asymptotic limit, as r —> 00 , 
the right-hand side of equation (13.39) assumes the simple form (13.13). 

For large r the integrand can be closely approximated in view of the fact that 
U + 0 only for values of r' < a. In the exponent, we expand in powers of r': 

Jc(v X 1* f 

k\v - r 7 | = kVr 2 - 2r • r' + r' 2 = kr - kr ■ r' + f • * • 

2r 

If r is chosen so large that 

ka 2 lr « 1 

then the quadratic term in the exponent can be neglected. If, furthermore, r' in the 
denominator of the integrand in (13.39) is neglected, we obtain for large r: 

±ikr 

rt*>(r) a Ne‘^ - — 

4ir r 


* 


Fik '- r 'G( r'liAirV) dV 


(13.40) 
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where we have set 


k' = kr 


(13.41) 


The asymptotic expression (13.40) can be written as 

*ikr \ 


e~ 


tirKr) = N\ e ikr + f?\r) 


(r large) 


where 



-7-7. [ ^‘ k ' r 'f/(r')</ 4 ±) (r') d 3 r’ 
4irN J 

f e :F ' k, ' r, V , (r')iAk ±) (r') d 3 r' 


(13.42) 


(13.43) 


Equation (13.42) shows why i// (+) and when supplemented by exp( — iEt/h), are 
called the spherically outgoing and incoming solutions of the Schrodinger equation: 
They satisfy the appropriate boundary conditions at infinity. The outgoing solution 
is indeed asymptotically of the form (13.13), thus verifying assertion (a) at the 
beginning of this section. It is customary to omit the superscript symbol (+) qual- 
ifying the outgoing scattering amplitude and to write / k (r) for / k +) (r). 

To prove assertion (b) we must employ the exact form (13.39) and demonstrate 
that for the initial wave packet 

J </>(k) ,exp(— zk • r 0 ) d 3 k J CXP j^ r> ^ ^ t/(r')«/4 +) (r') d 3 r' = 0 
Since U(r') = 0 for r' > a, it is sufficient to show that for r' < a 

J <p(k) exp(— ik • r 0 + ik\r — r' |)»/4 +) (r') d 3 k — 0 (13.44) 

i 

It may usually be assumed that in this integral the variation of i/4 +) (r') with k can 
be neglected. It can be seen from the integral equation for i/4 +) that this is true if 
the width of the wave packet A r » a and if we are not at an inordinately narrow 
resonance for which the scattering amplitude varies extremely rapidly with k. Most 
physical situations meet these conditions. 

Again we note that the wave packet <fi (k) is appreciably different from zero 
only for vectors k near the direction of k 0 , so that we may approximate as in (13.20), 

k\r - r'| « k 0 • k|r - r'| 

Hence, the left-hand side of (13.44) is nearly equal to 

J <t>( k) exp[/k • (k 0 |r - r' | - r 0 )] d 3 k = A^ -1 i/r(k 0 |r - r'|,0) ^ } (r') 

The right-hand side of this equation vanishes, because the vector ko |r — r' | points 
to a position behind the scatterer where the wave packet was assumed to vanish at 
t — 0. Hence, assertion (b) is proved, completing the discussion begun in Section 
13.2. 

Although the value of N does not affect the results, we may choose 
N = (27 r)~ 3/2 , corresponding to k-normalization. In Chapter 20 it will be shown 
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that, with this normalization, the energy eigenfunctions i/4 +) (r) are orthonormal, 
since 


J <Ak +> (r)*'Ak t) (r) = «(k - k') (13.45) 

When supplemented by the bound-state energy eigenfunctions, they make up a com- 
plete set of orthonormal functions. 

Exercise 13.6. If the potential V is real-valued, prove that <pl +> ( r ) and </' < -k( r ) 
are mutually time-reversed scattering- state solutions of the Schrodinger equation. 
Interpret this result. 

Exercise 13. 7. If the scattering potential has the translation invariance prop- 
erty V(r + R) = V(r), where R is a constant vector, (a) prove that the scattering 
solutions I/4" 0 of the integral form of the Schrodinger equation are Bloch wave 
functions, since they satisfy the relation 

4 ±} (r + R) — e !kR </4 ±) (r) 

and (b) show that the scattering amplitude vanishes unless q = k — k' is a reciprocal 
lattice vector (Exercise 4.14) which satisfies the condition 

q • R = (k - k') • R = Irrn (13.46) 

where n is an integer. This relation is the Laue condition familiar in condensed- 
matter physics. 3 


4 . The Born Approximation. Before we proceed, let us summarize the results of 
the preceding sections. If particles with an average momentum hk are incident upon 
a scatterer represented by the potential V(r), the differential cross section is given 
by (13.23), 


da 

dVt 


i/ k (r)| 2 


where the scattering amplitude f k (r) is defined as the coefficient of the outgoing 
wave in the asymptotic solution 


Jkr 


*Mc +) (r) = M e ‘ k r + — /k(r) (r large) 


of the Schrodinger equation 


(V 2 + k 2 )^ = ^ Vifj 


The scattering amplitude for elastic scattering in the direction k' is given by 
the formula 


Mk') = «-“’-'’V(r'W4 +, (r') £r 


(13.47) 


3 See Christman (1988), Section 4.2. 
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but since ^ (+) appears in the integrand, this is not an explicit expression. However, 
it can be used to obtain an estimate of the scattering amplitude if we replace the 
exact eigenfunction i/f (+) in the integrand by the normalized plane wave term on the 
right-hand side of Eq. (13.40), neglecting the scattered wave. In this approximation 
we obtain 


/k(k) - - 


m 


lirh* 




— ik'-r' 


V(r')e ikr ' d 3 r' 


(13.48) 


which is known as the scattering amplitude in the (first) Born approximation. Here 
the scattering amplitude appears proportional to the matrix element of the scattering 
potential between the plane waves, e lkT and e ,k r , representing the free particle 
before and after the scattering. It should be evident that the Bora approximation 
corresponds to a first iteration of (13.40), where the plane wave is substituted for 
i// (+) under the integral. The iteration may be continued to obtain higher order Born 
ipproximations. (See Section 20.3.) 

For a central-force potential, V(r), the Born scattering amplitude (13.48) re- 
luces to 


where 




m f 

27 th 2 . 


V(r’)e~ i(ir ' cPr' 


(13.49) 


q = k' - k 

s known as the momentum transfer (in units of h). As the Fourier transform of the 
cotential, the Born amplitude (13.49) resembles the field amplitude in Fraunhofer 
iiffraction. Figure 13.6 is a primitive Feynman diagram, depicting the first Born 
ipproximation scattering amplitude (13.48). 



Figure 13.6. Diagram representing the first Born approximation matrix element, (k' | V|k), 
)f the potential between eigenstates of incident momentum k and scattered momentum 
i' = k + q (in units of h). The potential V causes the momentum transfer q. The 
icattering angle is 8. 
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The integral over the solid angle in (13.49) is easily carried out and yields the 
result 


fsomW) 


_2m r 
h 2 Jo 


V(r') 


sin qr ,, 

r 

qr 


dr' 


(13.50) 


Here we denote scattering angle between k and k' by 0, and note that k' — k for 
elastic scattering, so that 


q 


Q 

2k sin - 
2 


(13.51) 


as seen in Figure 13.6. 

As an application, consider the screened Coulomb (or Yukawa) potential 

V(r) = V 0 e — (13.52) 

ar 


where the length 1 /a may be considered as the range of the potential. In the Born 
approximation, after a simple integration over angles, we find 


f Bom(^) — 




e ar ' sin qr' 
ar' qr' 


.'2 


dr' 


2m 1 _ 2mV 0 1 

h 2 a 0 q 2 + a 2 h 2 a 4 k 2 sin 2 (0/2) + a 2 


(13.53) 


The differential scattering cross section is obtained by taking the square of the am- 
plitude (13.53). 

The unscreened Coulomb potential between two charges q l and q 2 is a limiting 
case of the potential (13.52) for a — » 0 and V 0 — > 0 with V 0 /a = q^q 2 . Hence, in 
the Born approximation, 

dcr _ m 2 q 2 ql = q 2 ql 
da 4 p 4 sin 4 (0/2) 16 E 2 sin 4 (0/2) 


This result, even though obtained by an approximation method, is in exact agreement 
with both the classical Rutherford cross sections (13.9) and the exact quantum me- 
chanical evaluation of the Coulomb scattering cross section — one of the many re- 
markable coincidences peculiar to the pure Coulomb potential. Note, however, that 
the exact Coulomb scattering amplitude differs from the Born amplitude by a phase 
factor (see Section 13.8). 


Exercise 13.8. Calculate the total scattering cross section for the screened 
Coulomb potential (13.52) in the Born approximation and discuss the accuracy of 
this result. 


Exercise 13.9. Apply the Born approximation to the scattering from a square 
well. Evaluate and plot the differential and total scattering cross sections. 

Exercise 13.10. Obtain the differential scattering cross section in the Born 
approximation for the potential 

-r/a 


V(r) = -V 0 e 


(V 0 > 0) 
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Exercise 13.11. If V = Clr n , obtain the functional dependence of the Born 
cattering amplitude on the scattering angle. Discuss the reasonableness of the result 
[ualitatively. What values of n give a meaningful answer? 

A reliable estimate of the accuracy of the Born approximation is in general not 
>asy to obtain, since the term that is neglected is itself an integral that depends on 
he potential, the wave function in the region where the potential does not vanish, 
nd particularly on the momentum of the scattered particle. Qualitatively, it is easy 
o see that the Born approximation is a form of perturbation theory, in which the 
:inetic energy operator is the unperturbed Hamiltonian and the potential is the per- 
urbation. The approximation is thus likely to be valid for weak potentials and high 
nergies. 

The Born approximation affords a rapid estimate of scattering cross sections 
nd is valid for reasonably high energies in comparison with the interaction energy, 
because of its simplicity, it has enjoyed great popularity in atomic and nuclear 
•hysics. Its usefulness does not, however, vitiate the need for an exact method of 
alculating scattering cross sections. To this task we must now attend. 


I. Partial Waves and Phase Shifts. Let us assume that V is a central potential, 
t is to be expected that for a spherically symmetric potential the solution (13.39), 
epresenting an incident and a scattered wave, should exhibit cylindrical symmetry 
bout the direction of incidence. Indeed, we can also see formally from (13.39) that 
>k +) (r) depends on k and r and on the angle 6 between k and r only, if V is a function 
'f r alone. Hence, we may, without loss of generality, assume that k points in the 
ositive z direction and that for a given value of k, </4 +) is a function of r and the 
cattering angle 6. 


Exercise 13.12. Show that for a central potential (/4 +) is an eigenfunction of 
tie component of L in the direction of k, with eigenvalue zero, and discuss the 
ignificance of this fact for the scattering of a wave packet. 


We must thus look for solutions of the Schrodinger equation 

h 2 , h 2 !^ 

V 2 *A + V(r)tJj = — - 
2m 2m 


/hich have the asymptotic form 


</4 +) = N[ e 


ikr cos S 


ikr\ 

+ fkiO) — 
r i 


(13.54) 


t is desirable to establish the connection between these solutions and the separable 
olutions (12.3) of the central-force problem, 


Re.k(r)Y?(e, <p) = Y?(9, <p) 


(13.55) 


/hich are com m on eigenfunctions of H , L 2 , and L z . The radial functions R ( ,k( r ) and 
itk (r) satisfy the differential equations 



r n 


1 Re., 


e R^r) - 0 


(13.56) 
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and 



€(€ + 1) 


2m rj 

+ ^ V(r) - k 2 


u e , k ( r ) = 0 


(13.57) 


respectively, as well as a boundary condition at the origin. In general, this boundary 
condition depends on the shape of V, but, as we saw in Section 12.4, in most practical 
cases it reduces to the requirement that the wave function R^ k (r) be finite at the 
origin, from which it follows that 


u e , k ( 0) = 0 (13.58) 

We will restrict ourselves to potentials that are in accord with this boundary con- 
dition. The value of u' (ik { 0) implies a choice of the normalization of w €ji (r). 

The radial equation for the external region r > a, where the scattering potential 
vanishes, is identical with Eq. (12.9) which was solved in the last chapter. The 
general solution of this equation is a linear combination of the regular and irregular 
solutions and has the form 


ReA r ) = j e (kr ) + B e n,(kr) r> a 


(13.59) 


Using the asymptotic approximations (12.20) and (12.22), we get for large kr 


r , /N u (tk (r) _ a sin (kr — Z tt/2) cos (kr - €ttI2) 

RtAr ) = A e 7 B e r ( kr large) (13.60) 


kr 


kr 


In the complete absence of a scattering potential (V = 0 everywhere), the boundary 
condition at the origin would exclude the irregular solution, and we would have 
B ( = 0 for all values of i. Hence, the magnitude of B e compared with A e is a measure 
of the intensity of scattering. The value B e /A e must be determined by solving the 
Schrodinger equation inside the scattering region (r < a), subject to the boundary 
condition, (13.58), and by joining the interior solution smoothly onto the exterior 
solution (13.59) at r — a. To do this, we must know V explicitly and solve (13.57) 
by numerical methods, if necessary. 

A very useful expression for the cross section can be derived by introducing 
the ratios BJA t as parameters. Since for a real-valued potential u e ^ k {r) may be as- 
sumed to be real, these parameters are real-valued numbers, and we may set 


— = —tan 8 t (13.61) 

A e 

where 5^ is a real angle that vanishes for all i if V = 0 everywhere. The name 
scattering phase shift is thus appropriate for 8 f; , particularly if we note that (13.60) 
can now be written as 


u ek {r ) sin (kr — fir/2 + 8,) 

- - --- = e - (13.62) 

r kr 

The phase shift 5^ measures the amount by which the phase of the radial wave 
function for angular momentum £ differs from the no-scattering case (8 e = 0). Each 
phase shift is, of course, a function of the energy, or of k. 
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Exercise 13.13. Show that (13.62) implies the normalization 
r 

J o u eJc (r)u er (r) dr = ^ 8(k - k ') (13.63) 

'or the radial eigenfunctions. (Hint: Use the radial Schrodinger equation and inte- 
gration by parts.) 


In order to express the differential scattering cross section, or the scattering 
implitude, through the phase shifts we must expand i/4 +) (r, 9) in terms of the sep- 
arable solutions of the form (13.55), which (except for any bound states) are assumed 
o constitute a complete set of orthonormal eigenfunctions. Thus, we set 


... A u (k (r) A sin (kr — ZttII + 8 , ) 

<H + V, 0) = 2 C e (k)P ( ( cos 9) 2 C t (k)P t ( cos 9 ) 

(13.64) 


e—0 


e=o 


vhere use has been made of the fact that t/4 +) (r, 9) depends on the angle 9 between 
i and r and not on the directions of each of these vectors separately (see Exercise 
13.12). 

The expansion coefficients C ( (k) can be determined by comparing the two as- 
ymptotic expressions of the wave function, (13.54) and (13.64). We make use of the 
isymptotic expansion of the plane wave, (12.37), with which (13.54) can be written 
is 


—ikr °=> 


</4 +) = m-—J l (2e + i)(-i) f p € (cos 9) 

|_ 2 ikr (=o 

e ikr ^ e ikr 

+ S (2* + l)P € (cos 9) + f k (9) — 
2 ikr e=o r 


(13.65) 


Dn the other hand, (13.64) takes the equivalent form 

— ■ ikr ikr °° * 

¥k +) = ~ e — E C e (k)i e e~ iSt P f (cos 9) + £ C { (k)(-i) e e i8 ‘P e ( cos 9) (13.66) 

2 ikr f=o 2 ikr t = o 

3y comparing the incoming spherical waves in (13.65) and (13.66), we see that the 
sxpansion coefficients must be of the form 

C e = N(2£ + 1 )*V*‘ (13.67) 


substitution of these values into (13.64) gives the asymptotic expression for the 
vave function, 


i/4 +) (r, 9) = N 2 (2€ + 1 )?e iS * ™ ^ € 77-/2 + P((cos ( 13 . 68 ) 

kr 

rhis differs from a plane wave by the presence of the phase shifts and is called a 
iistorted plane wave. Comparing now the coefficients of the outgoing spherical wave 

7 ikr 

— in (13.65) and (13.66), we obtain 


Me) = 2 ( 2 € + 1 ) 


„2 iS, 


~ 1 


P € (cos 0) 


t= 0 


2ik 


(13.69) 
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or 


= 7 2 (2€ + sin S,(fc) P,(cos 0) 

AC f=0 


(13.70) 


This important formula gives the scattering amplitude in terms of the phase 
shifts by making what is known as a partial wave analysis of the scattering ampli- 
tude. If we remember that each term (partial wave) in the sum (13.70) corresponds 
to a definite value of angular momentum i, the formula may be seen in a more 
physical light. If the scattering potential is strongest at the origin and decreases in 
strength as r increases, then we may expect the low angular momentum components, 
which classically correspond to small impact parameters and therefore close colli- 
sions, to scatter more intensely than the high angular momentum components. More 
quantitatively, this semiclassical argument suggests that if the impact parameter 



exceeds the range a of the potential, or when t > ha, no appreciable scattering 
occurs. Thus, if ka » 1, making the classical argument applicable, we expect the 
phase shifts 8 e for € > ka to be vanishingly small. 

But this argument is of a more general nature. For suppose that ka « 1, as is 
the case for scattering at low energies. Then the incident waves are long, and at a 
given instant the phase of the wave changes very little across the scattering region. 
Hence, all spatial sense of direction is lost, and the scattering amplitude must become 
independent of the angle 6. By (13.70) this implies that all phase shifts vanish, 
except for 8 0 (k) corresponding to £ = 0. When this occurs we say that there is only 
S - wave scattering, and this is isotropic, since P 0 (c os 0) = 1. (See Section 13.7 for 
a better estimate of 8 ( .) 

We conclude that the partial wave sum (13.70) is a particularly useful repre- 
sentation of the scattering amplitude, if on physical grounds only a few angular 
momenta are expected to contribute significantly. In fact, if V is not known before- 
hand, we may attempt to determine the phase shifts as functions of k empirically by 
comparing scattering data at various energies with the formula 


£ - 


k 2 


X (2€ + l)e iS * w sin 8 e (k) P e { cos 0) 

f=0 


(13.71) 


By integrating, we obtain the total scattering cross section : 


■/ 


dfl = “ 2 ( 2 ^ + !) sin2 8 e 
ail kr 


(13.72) 


In the differential scattering cross section (13.71), or angular distribution as it 
is often called, contributions from different partial waves (angular momenta) inter- 
fere with each other, because the scattering amplitude is a sum of terms with dif- 
ferent € values. Such a sum is often said to be a “coherent” superposition of dif- 
ferent angular momenta. No such interferences occur in the integrated total cross 
section (13.72), which is therefore said to constitute an “incoherent” sum of partial 
wave contributions, o>. 
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Each angular momentum value contributes at most a partial cross section 


4tt 

(o-f)max = (2€ + 1) 


(13.73) 


o the total scattering cross section. This value is of the same order of magnitude as 
tie maximum classical scattering cross section per unit h of angular momentum, 
ndeed, if we use the estimate b = ilk for the impact parameter, applying (13.7) 
/e obtain for the contribution to the total cross section from a range A i = 1, or 
ib = 1 Ik, 


<r t = 


27rbAb = 2 tt 


L 

e 


: or large i, this agrees with (13.73) except for a factor 4. The difference is due to 
le inevitable presence of diffraction effects for which the wave nature of matter is 
esponsible. 


Exercise 13.14. Show that the scattering amplitude (13.70) and the total cross 
ection (13.72) are related by the identity 


477 

cr = — Im / k (0) 
k 


(13.74) 


'his formula, known as the optical theorem, holds for collisions in general (see 
ection 20.6). 

Finally, we note that the quantity 

S t (k) = e 2iS > (k) (13.75) 

'hich appears in the scattering amplitude (13.69), is the ratio of the coefficients 
mltiplying the outgoing wave, and the incoming' wave, e ~ i{kr ~ f7r/2) , in the 

'ave function (13.68). In Section 20.5 the quantities S { will be identified as eigen- 
alues of the S matrix. 


. Determination of the Phase Shifts and Scattering Resonances. The theoret- 
:al determination of phase shifts requires that we solve the Schrodinger equation 
>r the given potential and obtain the asymptotic form of the solutions. If the interior 
ave function is calculated and joined smoothly onto the exterior solution (13.59) 
: r = a, the phase shifts can be expressed in terms of the logarithmic derivatives 
: the boundary r — a: 


Pe = 



Using (13.59) and (13.61), we easily find the desired relation between the phase 
lift and the logarithmic derivative at r = a: 


Pfk) = ka 


j((ka) cos 8 ( - n'fka) sin 8 ( 


jfka) cos 8 e — n e (ka) sin 8 ( 


(13.76) 
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S t = e 


h ~ m e 
je + in e 


P e — ka 


X 


Je ~ 
je ~ in e 


Pe ~ ka 


j'e + in't 
je + in e 


(13.77) 


where all the spherical cylinder functions (j ( and n t ) and their (primed) derivatives 
are to be evaluated at the argument ka. The first factor 


e 2 ig e — je 

je + in e 


(13.78) 


in the expression (13.77) defines the (real) phase angles which have a simple and 
interesting interpretation. The quantities are seen to be the phase shifts if p ( — > oo ? 
which implies that 7? f (a) = 0. The radial wave functions vanish at r ~ a for all 
values of £ if the potential represents a hard spherical core of radius a, so that the 
wave function cannot penetrate into the interior at all. The angles & are therefore 
referred to as the hard sphere phase shifts. 


Exercise 13.15. Verify directly from (13.59) and (13.61) that (13.78) gives 
the phase shifts for a hard sphere, and plot their dependence on ka for the lowest 
few values of £. 


If we introduce the real parameters A e and s t . 


j' e + in' 

ka = A f + is t 

je + in e 


we obtain the simple relations 


e 2i(S e ~i t ) _ Pe + i s t 

P e ~ A t - is t 


and 


(13.79) 


(13.80) 


e lS{ sin 8 e 



is e 



(13.81) 


which make explicit how the partial wave contributions to the scattering amplitude 
depend on p { . 

From (13.79) and (12.23) we deduce that 


s e — ka 


je n 'e ~ 
j 2 e + n 2 ( 


1 

ka(j 2 e + r$) 


(13.82) 


which shows that s t is positive for all £. 

Let us apply these results now to the special case of the square well of range a 
and depth V 0 . From (12.41), 


Re(r) 


u t (r) 


je(k'r) 


r 
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/here hk' — V2 m(E -t- V 0 ). Hence, 


Pe = k'a l (13.83) 

Je(k a ) 

For t = 0, we find from (12.30) and (12.31) the simple expressions 

£ 0 = — ka, A 0 = — 1, s 0 — ka, /3 0 = k'a cotan k'a — 1 

'he logarithmic derivative /3 0 is a monotonically decreasing function of energy — 
n important property that can be proved more generally for any potential and all 
\ e . Applying these results to (13.81), we see that the 5-wave scattering amplitude 


1 e~ 2ika ( k 

fo = 7 e iS ° sin 8 0 = — — — — — - e ika sin ka 

k k \k cotan k a — ik 


(13.84) 


n the limit E — » 0, k — » 0, this gives the nonvanishing isotropic 5-wave cross section: 


cr 0 — > 47ra 


2 ( tan k^a 


1 for k’a 


2 mV 0 
1 h 2 


(13.85) 


Exercise 13.16. Compute &, A 1? Si, (3\, and for F-wave scattering from a 
quare well and examine their energy dependence. 


Figures 13.7 and 13.8, calculated for a particular square well, illustrate some 
mportant common features of scattering cross sections. The phase shifts 8 0 , 8 U and 
i 2 , which are determined only to within multiples of tt, were normalized so as to 
;o to zero as E — » °°, when the particle is effectively free. At low energies, P waves 
and waves of higher angular momentum) are scattered less than 5 waves, because 
he presence of the centrifugal potential makes it improbable for a particle to be 
ound near the center of force. Generally, the partial cross sections tend to decrease 
vith increasing angular momentum and increasing energy, but the figures also show 
hat the smooth variation of the phase shifts and cross sections is interrupted by a 
Iramatic change in one of the phase shifts and a corresponding pronounced fluctu- 
,tion in the partial cross section. Thus, for the particular values of the parameters 
tn which Figures 13.7 and 13.8 are based, the P-wave phase shift rises rapidly near 
a — 0.7. Since it passes near the value 37r/2 in this energy range, sin 2 becomes 
lose to unity, and the partial cross section approaches its maximum value, 
r, = \2Tr/k z . 

It may happen that in a small energy range a rapid change of the logarithmic 
lerivative /3 e can be represented by a linear approximation, 

Pt(E) = c + bE (13.86) 

vhile the quantities A e , and s t , which characterize the external wave function, 
r ary slowly and smoothly with energy, and may be regarded as constant. If this 
ipproximation is substituted in (13.80), we get 

r 

E - E 0 - i — 

2 i(8 t -Z t ) = ± 

r 

E - E 0 + i- 


(13.87) 
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Figure 13.7 S, P, and D phase sh ifts (S 0 , 8 U S 2 ) for scattering from a square well of 
radius a with k^a = s/2mV 0 a 2 /h 2 - 6.2. 



Figure 13.8. Momentum dependence of the partial cross sections (cr 0 and a^) for S and 
P waves corresponding to the phase shifts of Figure 13.7. The cross sections are given in 
units of 7m 2 . 
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vhere 


E 0 = 




ience, we arrive at the very simple approximate relation 


tan(S # - &) = 


r 

2(E 0 - E) 


(13.88) 


(13.89) 


Since j8 f is a decreasing function of the energy, b must be negative. By its definition, 
i e is positive, and it thus follows that the quantity T defined in (13.88) is positive. 
The expressions (13.87) and (13.89) are useful if E 0 and T are reasonably constant 
ind if the linear approximation (13.86) is accurate over an energy range large com- 
pared with T. Under these circumstances, it can be seen from (13.87) that the phase 

r r 

>(S € - if) changes by 2tt as E varies from E « E 0 - — to E » E 0 + — . Hence, 

f ^ is also nearly constant in this interval, the phase shift changes by 7 r, and the 
partial cross section which is proportional to sin 2 8 t , changes abruptly. Such 
>udden variations in the phase shifts are resonances, with E 0 being the resonant 
energy and T the width of the resonance. 

If the phase shift S e is near resonance, the contribution of the corresponding 
partial wave to the scattering amplitude can be written according to (13.81), (13.86) 
ind (13.88) as 

2/ + i / r/2 \ 

ft = — £ — e2i \ E e r/2 + e ~ i€e sin ^ ) P ^ C0S 6) ( 13 - 9 °) 

giving a neat separation of the resonant part of the partial wave amplitude from the 
nonresonant part, which depends only on i t . If as is the case for low energies or 
high angular momenta, the hard sphere phase shifts are negligible, (13.89) reduces 
to 


tan 8f = 


r/2 

E 0 - E 


The resonant term in (13.90) predominates then and contributes to the total cross 
section an amount 


A-TT 

— —y (2€ + 1) sin 2 


8 e = 


4tt(2€ + 1) 

e 


4 (E - E 0 f + r 2 


(13.91) 


For a small width T, this represents a sharp maximum centered at E 0 with a sym- 
metric shape similar to that of the transmission resonance peaks in Figure 6.8. 4 * * 
Profiles with this energy dependence are called Breit-Wigner cross sections. They 
are experimentally resolvable if the width T, while narrow, is still wide enough so 
that the particles in the beam may be represented by spatially broad wave packets 
with A£ « T. 

Since we have discussed phase shifts and resonances in this section entirely in 
terms of the logarithmic derivatives at the boundary of an interior region within 


4 The P - wave resonance in Figure 13.8 is not accurately described by the simple formula (13.91) 

because the conditions under which this formula was derived (£ f <sc 1 , Ai, and reasonably constant) 

are not well satisfied. 
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which the scattering forces are concentrated, our conclusions are independent of the 
particular mechanism that operates inside this region. The interior region is a “black 
box,” characterized only by the values of the logarithmic derivatives at the surface. 
In particular, there is no need for the phase shifts to result from an interaction with 
a simple, fixed potential V. Rather, the scattering may arise from much more com- 
plicated interactions within the black box, and resonances may occur as a result of 
constructive interference at certain discrete frequencies, if some mechanism impedes 
the escape of the wave from the box. In Sections 6.4 and 7.4, where transmission 
resonances were considered, we saw how a precipitate change in the potential or a 
high potential barrier can produce narrow resonances, but in nuclear and particle 
physics resonances come about through a more complicated mechanism. For ex- 
ample, when a slow neutron enters a complex nucleus, it interacts in a complicated 
way with all the nucleons and its re-emission may thereby be considerably delayed. 
Owing to the wave properties of matter, such metastable states can occur only at 
certain discrete energies, and these are resonant energies. At low energies the neu- 
tron scattering cross section thus exhibits sharp maxima that may be regarded as the 
remnants of the discrete level structure which exists for E < 0. 

The reciprocal width of a resonance 1/r is a direct measure of the stability of 
the resonant state or of the delay between the time of absorption of the particle by 
the black box and its re-emission. To see this, we now consider the extreme case of 
a very narrow resonance centered at energy E 0 and a sharply pulsed, well-collimated 
wave packet involving a broad energy range, A E, so that A E » T. The analysis of 
Section 13.2, which assumed a slow energy variation of the scattering amplitude is 
no longer applicable. To simplify the discussion, without affecting any qualitative 
conclusions, we now assume that we have a very narrow isolated €-wave resonance 
and that all nonresonant contributions to the scattering are negligible. The scattering 
amplitude is taken to be 


m = 


_ 21 + 1 


172 


(13.92) 


The wave packet is represented by a momentum wave function 4> (k) that is appre- 
ciably different from zero only for k in the direction of incidence but that has an 
energy spread A E » T. Using 

k • r 0 ~ — kr 0 and k • v 0 ** ku 0 


we find that the scattered wave in Eq. (13.19) is, asymptotically, proportional to the 
expression 




scatt 


OC 


"C 

.0 


OS e ik(r+r 0 -v 0 t) 


me dk 


OC 


t: 


9 i(E~ E 0 )(r+ r 0 - v ( /)/iiv 0 


172 


E — E 0 + i 172 


dE 

(13.93) 


As in (7.68)-(7.69) and (A. 22) in the Appendix, we evaluate the integral and obtain 


*Ascatt I 


r 7 ri exp 
0 


r 

2 h 


r + r 0 


v 0 t>r+ r 0 
v 0 t < r + r 0 


(13.94) 


Thus, once the scattered wave packet has reached a sphere of radius r, the probability 
of finding the particle at that location decreases exponentially with time. (See the 
analogous one-dimensional illustration in Figure 7.8). A narrow resonance corre- 



08 


Chapter 13 Scattering 


ponds to a situation in which the incident particle spends a long time, of the order 
f the mean lifetime, r = h/T, in the interaction region before being scattered. 

It is possible to think of the quantity E 0 — iT! 2, which appears in the denom- 
lator of the scattering amplitude as a complex energy of the resonant state. If the 
ariables E and k are analytically continued into the complex plane, Eq. (13.87) 
hows that E 0 — iT/2 is a simple pole of S e (k) — e 2,S((k) and that the state with this 
omplex E value has no incoming wave. 

Exercise 13.17. Using Figure 13.8, estimate the mean lifetime of the meta- 
table state responsible for the P- wave resonance in units of the period associated 
/ith the motion of the particle in the square well. 


Exercise 13.18. Show that for a resonance the quantity h(d8 ( /dE), evaluated 
t E = E 0 is a measure of the lifetime of the metastable state. 


. Phase Shifts and Green’s Functions. Although the relation (13.76) between 
base shifts and logarithmic derivatives of the radial wave functions is a very general 
nd simple one, it does not shed any direct light on the dependence of the 8 e on the 
cattering potential. This connection can be elucidated if a partial wave analysis is 
pplied to the integral equation (13.39). To this end, the outgoing Green’s functions 
aust first be expanded in terms of Legendre polynomials. 

Since G+(r, r') is a solution of the equations 

(V 2 + f)G + = 0 and (V' 2 + k 2 )G + = 0 

f r # r', it is Seen from the separation of these equations in terms of spherical 
oordinates that the partial wave expansion for r > r' must be of the form 

G + (r, r') = ex P|^l r r I) = j? qi (k)Pf r • r')j e (kr')hf\kr) ( r > r') (13.95) 
|r — r | £=o 


vhere the particular choice of the spherical cylinder functions (Bessel and Hankel 
unctions of the first kind) is dictated by the regular behavior of G+ at r' = 0 and 
ts asymptotic behavior, G+ —*■ e lkr !r as r The remaining unknown coefficients, 
l t (k), in the expansion (13.95) can be determined by letting r «=* r' «= 0 (but still 
■ > r'). By using the first approximations (12.17) and (12.28) for Bessel and Hankel 
unctions, (13.95) simplifies to 


r 



f q t (k) r' e 
e=o (2€ + 1 )ik r e+1 


Pe(r ■ n 


fhis has to be compared with the expansion, familiar in electrostatics, 



.'t 


= 2 ~ETi p f$ • f ') 


t = o r 


vhich comes from the generating function (11.68) for Legendre polynomials. Com- 
parison of the last two equations yields the values of q e (k) and the desired identity: 


G+(r, r') = ^ 7T ^ = ik S (2€ + 1 )P e (t • f %(kr')h?\kr) (r > r') 
r - r €=o 


(13.96) 
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We now substitute (13.96) and the partial wave expansion (13.64) of i/4 +) (r, 0) into 
the integral equation (13.39), using (13.67), and carry out the integration over the 
direction of r'. For r > a we obtain the radial integral equation 

e iS < = Mkr) ~ ike iS < [ ifkr'yP{kr)u^{r')U{r'y dr' (13.97) 
r Jo 

Letting r — > °° and replacing u (ik (r) and the cylinder functions of argument kr by 
their asymptotic expansions, we finally arrive at the simple formula 

(13.98).. 

This is an explicit expression for the phase shifts in terms of the potential and the 
radial eigenfunctions. 

Exercise 13.19. Show that for all values of r the radial wave function u e yr) 
satisfies the integral equation 

u e , k (r) = r cos 8 e jfkr) + kr f jfkr')n e (kr)u (tk (r')U(r')r' dr’ 

Jo 

r (13.99) 

+ kr jfkr)nfkr')u t ' k (r')Uyy dr' 

Exercise 13.20. Verify (13.98) by applying a partial wave analysis directly to 
the scattering amplitude (13.43) provided that V(r) is a central potential. 

Some useful estimates of phase shifts may be based on (13.98) and (13.99). For 
instance, if the potential is not strong enough to produce a resonance, these coupled 
equations may be solved by successive approximation in an iterative procedure. The 
zeroth approximation to the wave function is 

u?l(r) = r cos 8 e j ( (kr) 

When this wave function is substituted in (13.98), we get the approximate phase 
shift 

r oo 

tan 8 ( « —k [ j e (kr')] 2 U(r')r' 2 dr' (13.100) 

Higher approximations may be obtained by iteration, but this is usually cumbersome. 
For values of € > ka, the spherical Bessel functions in the integral (13.100) may be 
approximated by the first term in their power series expansion [see (12.17)]. We 
thus obtain 

o 2 e (f \) 2 r» 

tan S t - ~ m + [ )!]2 k 2<+l J o U(r')r ,2e+2 dr' (13.101) 

as an estimate for the phase shifts. From this formula we deduce the rule of thumb 
that, for low energies and high angular momenta, 8 e k 2e+l . 

Exercise 13.21. Show that a partial wave analysis of the scattering ampli- 
tude in the first Born approximation (13.47) gives the same estimate as (13.100) if 
8, e « 1 . 
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. Scattering in a Coulomb Field. Since a Coulomb field, V = C/r, has an infinite 
ange, many of the results that we have derived in this chapter under the assumption 
f a potential with finite range are not immediately applicable without a separate 
ivestigation. For example, the concept of a total scattering cross section is mean- 
igless for such a potential, because every incident particle, no matter how large its 
mpact parameter, is scattered; hence, the total scattering cross section is infinite. 

Exercise 13.22. Show that the total cross section obtained from the differential 
ross section (13.9) diverges. (See also Exercise 13.8.) 


Owing to its great importance in atomic and nuclear physics, a vast amount of 
/ork has been done on the continuum ( E > 0) eigenstates of the Coulomb potential, 
n this section, a particular method of solving the problem will be presented because 
f its intrinsic interest. This method depends on the observation that the Schrodinger 
quation with a Coulomb potential is separable in parabolic coordinates, which are 
iefined by the equations 

£ — r + z r) = r — z <p = arctan - (13.102) 

x 


"he Schrodinger equation for the Coulomb potential. 


V(r) = 


r 


(13.103) 


[escribing the electrostatic interaction between two charges q x and q 2 , becomes in 
hese coordinate? [see Appendix, Eq. (A. 58)]; 


— [ 4 d (cl.) 

2m U + r, d£ y d£j 


+ 


£+ p dr) 



1 d 2 " 

\ dv) 

+ dip 2 


* + 


£ + v 


- Eijj 
(13.104) 


rnr scattering, we are interested in a solution with axial symmetry about the direc- 
ion of incidence, which we again choose to be the z axis. Hence, we must look for 
olutions that are independent of ip and that have the form 

v) = fi(Of 2 (y) (13.105) 

Separation of the parabolic variables £ and 17 is achieved easily, and the Schrodinger 
squation is replaced by two ordinary differential equations 

+ T “ cj ' = 0> + i vf2 - cd2 = 0 (13106) 

vhere h 2 k 2 = 2 mE, and the constants of separation, c t and c 2 , are related by the 
jondition 


Ci + c 2 


qi<hm 

h 2 


(13.107) 


Scattering is described by a wave function that asymptotically has an incident plane 
vave part, 

exp (ikz) = exp[ik (£ — rj)!2] (13.108) 

ind a radially outgoing part, proportional to 

exp (ikr) = exp[/k (£ + 17)/ 2] 


(13.109) 
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The behavior of these two portions of the wave function suggests that we look for 
a particular solution with 

/i(f) = e ik&2 (13.110) 

Substituting this into (13.106), we see that the Ansatz (13.110) is indeed a solution 
if we choose 


Ci = i 


The remaining equation for f 2 , 

df 2 


dr) dr) 


k 2 , 


( q\q 2 m . k 

\ * 


~ ‘ir 2 = 0 


is conveniently transformed if we define a new function g(r /): 

fi( V) = exp (-ikr)/ 2) g(rj) 


(13.111) 


(13.112) 


For a stationary bound state, we know from Chapter 12 that for an attractive 
Coulomb potential, 


E„ = 


2 h 2 n 2 


= ~(T) 


where in the last equality the virial theorem (Exercise 3.26) has been used. If the 
mean kinetic energy is expressed as mv 2 /2, defining an effective velocity v, we 
obtain the relation 


n — 


hv 


(13.113) 


linking the principal quantum number n with the effective orbital speed v. If we 
apply the same defining relation (13.113) to the positive energy unbound scattering 
states in a Coulomb potential, a generally noninteger, positive or negative, quantum 
number n is defined. This allows us to cast the differential equation for g(r)) in the 
form 


d 2 g dg 

rj —r + (1 - ikr)) — — f nkg — 0 (13.114) 

dr) dr) 

which should be compared with Eq. (12.89) for the confluent hypergeometric func- 
tion. We see that the solution of (13.114), which is regular at the origin, is 

g(r)) = Min’, 1; ikr)) (13.115) 


For purely imaginary argument z, the asymptotic expansion of this function has the 
leading term 5 


Ma\ c; z ) 


T(c) 

F(c — a) 


z\ a exp 


(“ 2) + r ( a) |z| ” v exp 

i(a — 

c ^ 

to| ^ 

1 1 



(13 


. 116 ) 


5 Morse and Feshbach (1953), Eq. (5.3.51). 
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applying this approximation to g(rj), we obtain for large values of 17, 


g(v) = 


exp (— «7 t/2) 
|r(l + in) 


n 


exp [—i(n In kp — cr n )\ + 7— exp[i(kri + n In krj — 

kp 


.)]} 


(13.117) 

vhere T(1 + in) = |T(1 + in) \e ,crn . 

Collecting all these results together, we conclude that the particular solution 


«K£ 17, (p) = exp [ik(£ - 'q)/2] l F 1 (in\ 1; ikrj) 
)f the Schrodinger equation has the asymptotic form 
exp (—mr/2) 


(13.118) 




|T(1 + in) | 


exp{ i[kz — n In k{r — z) + a n ]} 


(13.119) 


+ 


n 


k(r - z) 


exp{i[£r + n In k(r — z) — &„]} 


'or large values of r — z = r(l — cos 9), where 9 is the scattering angle. Equation 
13.119) is valid at large distance from the scattering center, except in the forward 
Erection. It shows clearly why it is impossible to obtain for a Coulomb potential 
m asymptotic eigenfunction that has the simple form, 


</r = N 


„ikr 


e lkz + fM — 

r 


deduced for a potential of finite range. The Coulomb force is effective even at very 
?reat distances and prevents the incident wave from ever approaching a plane wave. 
Similarly, the scattered wave fails to approach the simple free particle form, e lkr lr, 
is r — > 00 . However, it is important to note that the modification, for which the 
Coulomb potential is responsible at large distances from the scattering center, affects 
enly the phase of the incident and scattered waves. Since the asymptotic contribu- 
tions from the Coulomb potential to the phases are logarithmic functions of r, and 
lienee vary but slowly, the analysis of the fate of an incident wave packet in a 
scattering process can still be carried through, and the result is precisely the same 
as before. The differential cross section is [see (13.23)] 


% " I *®' 2 


where the scattering amplitude f k (9) is again the factor multiplying the radial part 
of the asymptotically outgoing wave, provided the incident (modified) plane wave 
is normalized so that the probability density is unity. Hence, for all finite values of 
the scattering angle 9, 


fk(.0) = TZ — ™ — exp[m ln(l - cos 0)] 

k{ 1 — cos 9) 


(13.120) 


and the differential scattering cross section is 


dcr _ n 2 _ q 2 ql 1 

dTl ~ 4 If sin 4 (9/2) ~ USE 2 sin 4 (9/2) 


(13.121) 
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in exact agreement with the classical Rutherford scattering cross section (13.9) and 
the Born approximation. An angle-independent addition to the phase has been omit- 
ted in (13.120). 

Since i Fi(a\ c; 0) = 1, the normalization of the eigenfunction (13.118) is such 
that if/ — 1 at the scattering center (origin). On the other hand, the incident wave in 
the large brace of (13.119) is normalized to probability density unity far from the 
scattering center, and 


i < K °°)| 2 


exp(— nir) 
f(l + in) \ 2 


Hence, the penetration probability of finding the particle at the origin relative to the 
probability of finding it in the incident beam is 


\m \ 2 

I'M 00 )! 2 


= |r(l + in) IV 17 = 


lirn 

1 - <T 2 ™ 


(13.122) 


where in obtaining the last expression use was made of (8.90) and the definition of 
the T function: T(1 + z) = zF(z). 

If the Coulomb potential is repulsive ( q x q 2 > 0 and n < 0) and strong compared 
with the kinetic energy of the particle, we have as a measure of penetration to the 
origin, 

2 

|2 = -27 me 2 ™ (i n « 0) (13.123) 


Iwl 

l<A(°°) 


The significance of the penetrability, 


e 2jrn 



2TTq 1 q 2 \ 
hv ) 




(13.124) 


was already discussed in connection with Eq. (7.52) for a one-dimensional WKB 
model of Coulomb scattering. The exponent G is called the Gamow factor in nuclear 
physics. 


Exercise 13.23. Discuss other limiting cases (fast particles, attractive poten- 
tials) for (13.122). 

Exercise 13.24. Calculate the wave function (13.1 18) in the forward direction. 
What physical conclusions can you draw from its form? 


Since the Schrodinger equation is separable in parabolic coordinates only if the 
potential behaves strictly as 1/r at all distances, the method of this section is not 
appropriate if the potential is Coulombic only at large distances but, as in the case 
of nuclear interactions with charged hadrons, has a different radial dependence near 
the origin. It is then preferable to use spherical polar coordinates and attempt a phase 
shift analysis. By expanding the eigenfunction (13.118) in terms of Legendre poly- 
nomials, phase shifts can be calculated for the Coulomb potential, and the theory of 
Section 13.5 can be extended to include the presence of a long-range 1/r potential. 



514 


Chapter 13 Scattering 


°roblems 

L. Using the first three partial waves, compute and display on a polar graph the differ- 
ential cross section for an impenetrable hard sphere when the de Broglie wavelength 
of the incident particle equals the circumference of the sphere. Evaluate the total 
cross section and estimate the accuracy of the result. Also discuss what happens if 
the wavelength becomes very large compared with the size of the sphere. 

1. If the scattering potential has the translation invariance property V(r + R) = V(r), 
where R is a constant vector, show that in the first Born approximation, as in the 
exact formulation (Exercise 13.7), scattering occurs only when the momentum trans- 
fer q (in units of h) equals a reciprocal lattice vector G. 



CHAPTER 14 


The Principles of Quantum Dynamics 


If the Hajniltonian operator is known, an initial wave function if/(r, 0) 
develops in time into if/(r, t ) according to the time-dependent Schrodinger 
equation (Chapters 2 and 3). This algorithm for calculating the future 
behavior of a wave packet from its past history was used in Chapter 13 
for scattering calculations. We now extend the general principles of 
quantum mechanics (Chapters 9 and 10) to the laws governing the time 
evolution of quantum systems, utilizing several equivalent dynamical 
pictures (Schrodinger’ s, Heisenberg’s, Dirac’s). The canonical 
quantization of systems with classical analogues is discussed and applied 
to the forced harmonic oscillator, which is the prototype of a system in 
interaction with its environment. 


1 . The Evolution of Probability Amplitudes and the Time Development Operator . 
We now add the time parameter to the description of quantum states and generalize 
the fundamental postulate of quantum mechanics (Section 9.1) by asserting that: 


The maximum information about the outcome of physical measurements on a 
system at time t is contained in the probability amplitudes (i£ ; |' v I r (t)), which 
correspond to a complete set of observables Kfor the system. 

The only new feature here is that we now recognize formally that the state ^ is a 
function of time. 

For the simple system of a particle with coordinates x, y, z as observables, the 
amplitude is 

<A(r, 0 = <*, y, z|^(0) (14.1) 

which is the time-dependent wave function in the coordinate representation. The 
same state is represented in the momentum representation as 

0(P> 0 = (Px,Py,Pz |¥(0> (14.2) 

The basic question of quantum dynamics is this: Given an initial state |‘^ r (f 0 )) 
of the system, how is the state |^(t)) at time t determined from this, if indeed it is 
so determined? Or, in terms of the amplitudes that specify the state, how do the 
amplitudes (L / |''F(t)) evolve in time from the initial amplitudes (^T j |^ r (/ 0 ))? 

The assertion that |'^ r (f 0 )) determines |^(0) is the quantum mechanical form of 
the principle of causality, and we shall assume it. The dynamical law that connects 
the initial and final amplitudes is contained in the further assumption that the com- 
position rule (9.8) can be generalized to the time-dependent form 


0,m0> = I (Lj\T(t, tofKfiK.lnto)) 

i 


(14.3) 


where the coefficients 
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ire independent of the state |' v f r (t 0 )). They have a direct and simple interpretation: 
The expression (14.4) signifies the probability amplitude for finding the system at 
time t in the eigenstate of | Lj) of the observables symbolized by L, if at time t 0 it 
was known to be in the eigenstate |AT f ) of the observables K. This quantity is called 
a transition amplitude. With forethought it has been written in the form of a matrix 
element of an operator T(t, t 0 ), because from (14.3), which is valid for any state 
¥(f 0 )>, we can derive the transformation equation 

(Lj\T(t, r 0 )|iO = 2 2 (Lj\N p )(N p \T(t, t 0 ) \ M e )(M t \ K t ) (14.5) 

P * 

which shows that (14.4) defines a representation-independent linear operator T{t, t 0 ). 
Equation (14.3) is consistent with the composition rule (9.8) if we require that 

(Lj | 7ft, t)\Ki) = ( Lj\Ki ) for all times t 


or 


T(t, t) = 1 


(14.6) 


Exercise 14.1. Prove the relation (14.5) from (14.3) without assuming that 
(14.4) is the matrix element of an operator. 


It now follows from the composition rule (14.3) that the time development or 
evolution operator T(t, t 0 ) relates the initial state ^(to)) to the final state iSPft)) 
according to 


\nt)) = T(t, t 0 )llM> 


(14.7) 


Since T(t, t 0 ) does not depend on [^(to)), the principle of superposition applies 
to the time development of states. This means that if |''P a (t 0 )) and [^(to)) separately 
evolve into 1^(0) and |^(0). then a superposition c a [ ''P fl (t 0 )) + | \P),(?o)) devel- 

ops into c a \ty a (t)) + c b \ty b (t)), i.e., each component of the state moves indepen- 
dently of all the others, expressing the fundamental linearity of quantum dynamics. 

From (14.7) it follows that 

l^fe)) = T(t 2 , ti)l^(ti)) = T(t 2 , tf)T(t\, t 0 )\V(t 0 )) = T(t 2 , to)l 1 J f (^o)) 


Hence, the time development operator has the property 

T(t 2 , t 0 ) — T(t 2 , tf)T(t\, t 0 ) 


(14.8) 


From (14.6) and (14.8) we infer that 

T(t, t 0 )T(t 0 , t ) = T(t 0 , t)T(t, t 0 ) = 1 


or 


[T(t, t 0 )3 _1 = T(t 0 , t) 


(14.9) 


For small e we may write 

T(t + e, t) = 1 — -r sH(t) 
n 


(14.10) 
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defining an operator Hit). (The reason for introducing the factor it ft, apparently 
capriciously, will become evident forthwith.) Since, by (14.8), 

T(t + e, t 0 ) = T{t + e, t)T(t, t 0 ) 

we have with (14.10) the differential equation for T, 

dT(t, t Q ) T{t + s, t 0 ) ~ T{t, to) i . , 

= hm — — = - r H(t)T(t, to) 

dt e ->o e n 


or 


f o) . \ 

in — - — = H(t)T(t, t 0 ) 
at 


(14.11) 


with the initial condition T(t 0 , t 0 ) = 1 . 

The linear operator H(t) is characteristic of the physical system under consid- 
eration. We will see that it is analogous to the Hamiltonian function in classical 
mechanics. This analogy has led to the name Hamiltonian operator for H(t), even 
when the system has no classical counterpart. 

We also have 


I nt + e)) = Tit + e, t) I TO) 


or, to first order in e, 

I ¥(0> + sj { |¥(f)> = 



I ¥(0> 


Hence, 


Its bra form is 


ih ~ |¥(f)> - Hit)\nt)) 

dt 


dt 


(14.12) 


(14.13) 


Equation (14.12) is the equation of motion for the state vector, giving the general 
law of motion for any Hamiltonian system. 

To specialize to a particular system, we must select an appropriate Hamiltonian 
operator. The form of (14.12) is reminiscent of the time-dependent Schrodinger 
equation (3.42). This is no accident, for we have merely reformulated in abstract 
language those fundamental assumptions that have already proved their worth in 
wave mechanics. Of course, (14.12) is an equation for the state vector rather than 
for the wave function (14.1), but the distinction is one of generality only. It now 
becomes clear why Planck’s constant was introduced in (14.10). In Chapter 15, we 
will see how the laws of wave mechanics derive from the general theory. 

In wave mechanics, the differential operator H was generally Hermitian, as it 
must be whenever the Hamiltonian corresponds to the energy operator of the system. 
Generally, we will assume that H is Hermitian. A non-Hermitian Hamiltonian op- 
erator, like a complex index of refraction in optics, can be useful for describing the 
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dynamics of dissipative systems which, through absorption or decay, exchange en- 
ergy with their environment. 

If H(t ) is Hermitian, the equation adjoint to (14.11) becomes 

-in to) = T(U tf)H{t) (14.14) 

at 

and (14.13) becomes 

-ih^mt) I = (^(0 1 Hit) (14.15) 

at 

By multiplying (14.11) on the left by T*(t, t 0 ) and (14.14) on the right by T(t, t 0 ) 
and subtracting the two equations, we get 

“ [T\t, t 0 )T(t , t 0 )] = 0 
at 

Since, by (14.6), the product of the two operators equals the identity at t — t 0 , it 
follows that the time development operator is unitary: 

T\t, t 0 )T(t, t 0 ) = 1 (14.16) 

Hence, the norm of any state vector remains unaltered during the motion. If |^'(t 0 )) 
is normalized to unity, such that (M r (r 0 )|''I , '(to)) = 1, then the normalization will be 
preserved in the course of time, and we have from (14.7) and (14.16) that 

(q/( ? ) | ijr^)) — i f or a q times t 

consistent with the assumption that |(L,|'4 r (0)| 2 is the probability of finding the 
observable L to have the value Lj at time t. 

Often H does not depend on the time, and then T can be obtained for finite time 
intervals by applying the rule (14.8) repeatedly to n intervals, each of length 
e = (t — t 0 )/n. Hence, by (14.10) we have, with the initial condition T(t 0 , t 0 ) = 1, 

T(t, t 0 ) = lim ( 1 - j sH 
e — >0 \ n 


— lim 


1 1 (t ~ t(i) V 

1 y * 

n n 


1 " 


In the limit we get by the definition of the exponential function, 


T{t, to) = exp 




(14.17) 


It is obvious that T is unitary if H is Hermitian. 

Quantum dynamics is a general framework and contains no unambiguous pre- 
scription for the construction of the operator H whose existence it asserts. The Ham- 
iltonian operator must be found on the basis of experience, using the clues provided 
by the classical description, if one is available. Physical insight is required to make 
a judicious choice of operators to be used in the description of the system (such as 
coordinates, momenta, and spin variables) and to construct the Hamiltonian in terms 
of these variables. 

Contact with measurable quantities and classical concepts can be established if 
we calculate the time derivative of the expectation value of an operator A, which 
may itself vary with time: 


1 'X A 

ih ~ <^(0|A|^(0) - -OPri)ltfAl'P(O) + (^(0|A/f|^(t)) + ihQSnt) \ — | <P(f)> 

at dt 
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or 



(14.18) 


where, as usual, the brackets (...) signify expectation values of the operators 
enclosed. We see that the commutation relations of H with observables play an 
important role in the theory. If A is independent of time and commutes with H, the 
expectation value of A is constant, and A is said to be a constant of the motion. 

A special example of a time-dependent operator is the density operator for the 
state |^(0), 

p = |¥(r»W0| (14.19) 

From the equations of motion (14.12) and (14.15), 

ih ^ - |¥(0>Wf)|ff = [H, p ] (14.20) 

at 

Hence, (14.18) gives the simple result 


j f (p) = 0 (14.21) 

which is not surprising in view of the conservation of probability. The definition 
(14.19) implies that 

<[h, p]> = moi w, mo>moi] mo> = o (14.22) 


which, by (14.20), leads to the conclusion: 



(14.23) 


These results have sometimes led to the oxymoronic proposition that the density 
operator is a “time-dependent constant of the motion”! 

The formal relations derived in this section are all rooted in what we have called 
the quantum mechanical principle of causality, which states that the probability 
amplitude for finding the value of K t of the generic observables K at time t can be 
written as the inner product (^f ! | 1 f r (0)- The observables K and their eigenvectors are 
regarded as constant. Physically, this implies that a system represented by the same 
state vector at two different times has the same statistical distribution at the two 
times with regard to all observables of the system. In other words, | NP(f)> completely 
characterizes the state of a system at time t, which was the fundamental assumption 
made in Chapter 9. We will now discuss other, equivalent, formulations of quantum 
dynamics. 


2. The Pictures of Quantum Dynamics. The mathematical formulation of quan- 
tum dynamics given in the last section is not unique. There we showed that the state 
vector of a system alone may be made responsible for the time evolution, but this 
is not the only way of dealing with dynamics in the framework of vector spaces. 
State vectors themselves are not observable or measurable quantities. Rather, the 
eigenvalues of Hermitian operators and probability amplitudes such as (14.3) and 
(14.4) are the physically significant objects of quantum mechanics. Comparison with 
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observation is made in terms of the eigenvalues of observables and of expansion 
coefficients (probability amplitudes), which are inner products in the abstract state 
vector space. Measuring an observable L at time t means finding one of its eigen- 
values Lj, the probability of the particular result being given by \{Lj | ^(0) ] 2 if | M*(0) 
denotes the state of the system at time t. 

It follows that two vector space formulations, or pictures, are equivalent and 
equally acceptable provided that 

(a) the operators corresponding to the observables maintain their eigenvalue 
spectra in the two formulations, and 

(b) the inner products of physical states with eigenvectors for the same eigen- 
values are equal in the two pictures. 

It follows from (a) and (b) that all expectation values remain the same from one 
picture to the other. 

Starting from the Schrodinger picture of quantum dynamics, a new picture is 
obtained, satisfying conditions (a) and (b), by applying a time-dependent unitary 
transformation U{t) to all states and operators. All the state vectors change from 
1^(0) to t/COl^CO)* and every operator A is transformed into U(t)AU f (t). Owing 
to the unitary property of U(t), all eigenvalues and all inner products remain invar- 
iant, but the eigenvectors of an observable change from j A ') to U(t) \ A '}. Expectation 
values remain unchanged. 

The simplest choice for the transformation U(t) is 

U(f) = 7T(0, t) = T\t, 0) (14.24) 

Since 

f/(t)|^(0> = 7X0, t)\V(t)) = |¥(0)> (14.25) 

this transformation has the effect of referring all state vectors back to their values 
at the initial time 0. In this formulation, called the Heisenberg picture, the state 
vectors are constant in time, and we denote them as 

|^(0)) = fW\ (14.26) 

The observables, which in the Schrodinger picture were represented by operators 

fixed in time (unless they happened to have an explicit time dependence), are rep- 
resented in the Heisenberg picture by time- dependent operators, 

Iff) = U(t)LU\t) = T\t, 0)LT(t, 0) (14.27) 

At the initial time t = 0, 

1(0) = L (14.28) 

These equations exhibit the evolution of the _Heisenberg operators. To identify the 
Heisenberg picture, the bar notation, as in |^) and L(t), will be used only in this 
chapter. Elsewhere in this book, when there is no ambiguity, we will simply use 
I^P) and L(t ) for Heisenberg states and operators. 

As a special case, note that if H is constant and energy is conserved, then 

Tit, 0) = exp^” Ht 



(14.29) 
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and Hit) = H. If H is constant, the Hamiltonian operator does not change in time, 
even in the Heisenberg picture. 

By differentiating (14.27) with respect to t and using the equation of motion 
(14.11) for the time development operator, we obtain the Heisenberg equation of 
motion for an observable: 


ih j Lit) = [Lf), Hit)] + ih ^ 
at at 

The last term arises from the definition 


8Ljt) 

dt 


T\t, 0) - Tit, 0) 

at 


(14.30) 


(14.31) 


in the event that L is explicitly time-dependent. To emphasize that dLit)/dt is the 
Heisenberg form of the derivative of the explicitly time-dependent operator L, 
strictly speaking we should write this operator as 


dL _ dLjt) 
dt dt 


(14.32) 


The expectation value of the operator equation (14.30) in the Heisenberg picture, 
where the states are constant in time, is the counterpart of Eq. (14.18) in the Schro- 
dinger picture. 

Eigenvectors of observables, corresponding to the same eigenvalues, differ in 
the Schrodinger and Heisenberg pictures by the unitary transformation Uit): 

\ v> = m t)\Lj) (14.33) 


or 

! Lj) = Tit, 0)|Z~7> 

Differentiating this last equation with respect to t, we obtain 


dTjt, 0) 
dt 


I Lj, t) + Tit, 0) — j Lj, t) = 0 


(14.34) 


or, using (14.1 1), 

ih 4 I Lj, t) = -H\L„ t) (14.35) 

dt 

which is very similar to the equation of motion (14.2) in the Schrodinger picture, 
except for the all-important minus sign. Its appearance shows that if in the Schro- 
dinger picture we regard the state vectors as “rotating” in a certain direction in 
abstract vector space and the operators with their eigenvectors as fixed, then in the 
Heisenberg picture the state vectors stand still and the operators with the eigenvec- 
tors “rotate” in the opposite direction. But the mutual relation between state vectors 
and operators is the same in the two pictures. They are related to each other in much 
the same way as the two kinematic descriptions of the rotation of a rigid body with 
respect to a reference frame. We can consider the body moving in a fixed frame, or 
the body as being at rest, with the reference frame rotating backward. Since the two 
pictures are connected by a unitary transformation, the probability amplitudes (inner 
products) are equal: 


<£,!^( 0 > = {Lj,t |¥> 


(14.36) 
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Exercise 14.2. Find the equation of motion for the Heisenberg bra (Lj, f|. 
Exercise 14.3. Show that in the Heisenberg picture the density operator for 


state I 1 ?), 



p = iTO))mo)i - m<v\ 


(14.37) 

satisfies the equations 



ih — = ihTft, 0) — T(t, 0) = [H, p\ and 

dt dt 

o 

II 

£1* 

(14.38) 

and that the expectation value of p is constant as in (14.21). 




Instead of attributing the time evolution to either the state vectors or the op- 
erators, as in the Schrodinger and Heisenberg pictures, it is obviously possible to 
partition the time development operator in infinitely many different ways. We can 
arrange to let both the state vectors and the observables carry a complementary share 
of the time development. If the system and its Hamiltonian are complicated, it is 
often sensible to choose for U(t ) the adjoint of the time development operator for a 
suitably chosen model Hamiltonian H 0 , which is the solution of 


ih = — U(t)H 0 
at 


(14.39) 


subject to the initial condition (7(0) = 1. Generally, H 0 may be time-dependent. 
The observables are then transformed into new operators 

L(t) = U(f)LU\t ) (14.40) 


and these satisfy the equation of motion 


ih ~ Lit ) = [Lit), H 0 ] + ih 
dt 


dL{t) 

dt 


(14.41) 


More important is the equation for the transformed state vector, 


| TO) = 1/(0 1 TO) (14.42) 

By differentiating this equation with respect to t, and using (14.12), we obtain 

ih ~ |TO) - TO | TO) + U(t)HUHt)\nt)) = U(t)(H - H 0 )U\t)\nt )) 
dt 

We define an interaction term V as the difference between the Hamiltonian H and 
the model Hamiltonian H 0 by 

H = H 0 + V (14.43) 

giving us the simple-looking formula 


ih j | TO) = F(0 1 TO) 

dt 


(14.44) 


where 

no = u(t)vu\t) 


(14.45) 
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is the transformed version of V. The resulting formulation of quantum dynamics is 
known as the interaction (or Dirac ) picture. Note that if the model Ha mi ltonian H 0 
is time-independent and thus conservative. 


U(t ) = exp( — H 0 t 


(14.46) 


and 

a 

H 0 (t) = Ho (14.47) 

If Hq is chosen to be the full Hamiltonian, H 0 — H, the interaction picture coalesces 
with the Heisenberg picture. If H 0 = 0, the Schrodinger picture is recovered. The 
interaction picture will be found useful when we consider time-dependent pertur- 
bation theory in Chapter 19. 

To demonstrate the total equivalence of the various pictures of quantum dy- 
namics, let us suppose that at time t 1 the system has the definite sharp value A ' for 
the observable A. We ask: “What is the probability that at time t 2 the system will 
have value B" if the observable B is measured?” The answer is that the required 
probability is the square of the absolute value of the amplitude 

(B", t 2 \f(t 2 , h)\A', f) 

where f(t 2 , t } ) is the time development operator for the state vector in the interaction 
(or really a generic) picture as defined by 

1^(0) = T(t, t 0 )\V(t 0 )) (14.48) 

From this it follows, with (14.42), that 

T(t 2 , h) = U(t 2 )T(t 2 , h)U\t 0 (14.49) 

Since we infer from (14.42) that 

| A', t ) = U(t)\A') and (B", t\ = {B"\tf(t) (14.50) 

we see that the transition amplitude can be expressed equivalently in all pictures as 

C B ", t 2 1 T(t 2 , h)\A' t f) = (B"\T(t 2 , h )\A') = {W^lWTh) = (BfhlAVh) 
Interaction Schrodinger Heisenberg picture 

(14.51) 


The distinctiveness of the Schrodinger and Heisenberg pictures is manifested by the 
important fact that the Hamiltonian H (energy) is the same in both pictures. 


Exercise 14.4. Show that the expression 

(B",t 2 \f(t 2 , 10^,0 

for the transition amplitude is quite general and gives the correct answer if the 
Schrodinger ( H 0 = 0) or Heisenberg ( H 0 = H) pictures are employed. 

3. Quantization Postulates for a Particle. Let us now apply the general equations 
of quantum dynamics to the special case of a point particle with a mass m. We are 
concerned with the quantum behavior of this system, but it does have a classical 
analogue — Newtonian mechanics, or its more sophisticated Lagrangian or Hamil- 
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tonian forms. The position operators x, y, z are assumed to form a complete set of 
commuting observables for this physical system. 

For the purposes of this chapter it is worthwhile to distinguish in the notation 
between the classical observables, x, y, z, which represent numbers, and the cor- 
responding quantum observables, x, y, z, which stand for operators. 

Setting A = x in (14.18), we obtain 


d{x) _ (xH — Hx) 
dt ih 


(14.52) 


On the left-hand side there is a velocity, but if we wish to compare this equation 
with the classical equation for velocities we cannot simply let the operators go over 
into their classical analogues, because classical observables commute and we would 
have zero on the right-hand side. Hence, we must let h — » 0 at the same time. Thus, 
we formulate a heuristic version of the correspondence principle as follows: 

If a quantum system has a classical analogue, expectation values of operators 
behave, in the limit h — > 0, like the corresponding classical quantities. 

This principle provides us with a test that the quantum theory of a system with 
a classical analogue must meet, but it does not give us an unambiguous prescription 
of how to construct the quantum form of any given classical theory. Certainly, we 
cannot expect that every valid classical equation can be turned into a correct quan- 
tum equation merely by replacing classical variables by expectation values of op- 
erators. For example, xp x = mx ( dxldt ) = (1/2 )m(dx 2 /dt) is a valid classical equation 
if not a particularly useful one; yet, for operators (xp) = (1/2 )m(d/dt)(x 2 ) is gen- 
erally wrong, although 

.... d(x) / 1 \ 1 d 2 . 

(x)(p x ) = m(x) — and < - (xp x + p x x) ) = - m — (x ) 


are both correct. The trouble comes from the noncommunitivity of x and p x . 

To make the conversion from classical to quantum mechanics, the correspon- 
dence principle must be supplemented by a set of quantization rules. These rules 
have to be consistent with the correspondence principle, but their ultimate test lies 
in a comparison between the theoretical predictions and the experimental data. 

We expect that (14.52) is the quantum analogue of one of Hamilton’s equations, 


dx _ dH 
dt dp x 


(14.53) 


where H is the classical Hamiltonian function of x, y, z, p x , p y , p z , which characterize 
the system. The correspondence principle requires that 


lim 


(xH - Hx) 
ih 


dH 

dp* 


Similarly, for A — p x , we have in quantum mechanics 

d{p x ) _ ( p x H ~ Hp x ) 


dt 


ih 


(14.54) 


(14.55) 


dp x 

dt 


and classically, 


dH 

dx 


(14.56) 
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The correspondence principle requires that 


lim 

o 


(pjj ~ Hp x ) 
ih 


aH 

dx 


(14.57) 


Similar equations follow for y and z and their conjugate momenta. 

All these conditions can be satisfied if we do the following: 

1 . Let H be »a Hermitian operator identical in form with H but replace all 
coordinates and momenta by their corresponding operators. 

2. Postulate the fundamental commutation relations between the Hermitian op- 
erators representing coordinates and momenta: 

[ x,p x ] = [y, p y ] = [z, p z ] = ihl (14.58) 

[x, p y ] = [x, p z ] = [y, p z ] = [y, p x ] - [z, p x ] = [z, p y ] = 0 (14.59) 

The coordinates, x, y, z, commute with each other; the three momenta p x , p y , 

p z , also commute. 

Prescription (1) must be applied with care if H contains terms such as xp x , 
because x and p x are noncommuting and would upon translation into operator lan- 
guage give rise to a non-Hermitian H. The symmetrized operator ( xp x + p x x)/2 can 
then be used instead. It is Hermitian and leads to the correct classical limit. Some- 
times there may be several different ways of symmetrizing a term to make it Her- 
mitian. Thus the Hermitian operators x 2 p\ + p 2 x x 2 and (xp x + p x x) 2 l 2 both have the 
classical limit 2x 2 p x , but they are not identical. In practice, it is usually possible to 
avoid such ambiguities. 


Exercise 14.5. Show that the operators x 2 p 2 + p 2 x 2 and (xp x + p x x) 2 / 2 differ 
only by terms of order h 2 . 

The consistency of conditions (1) and (2) and their agreement with (14.54) and 
(14.57) can be verified for any H that can be expanded in powers of the coordinates 
and momenta. For instance, the commutation relation [jc, p x ] = ihl agrees with 
(14.54) and (14.57), as can be seen if we choose H — p x and H = x, respectively. 
The consistency proof can be continued by letting H = x n . Then 

\ lPx , *"] = — - {p x , *1 + 7T [Px, = - nx n ~ l 

m m in 

by virtue of repeated invocation of the quantum conditions (14.58). This is in agree- 
ment with the classical limit (14.57) because for H = x", 

dx n . 

= —nx 1 

dx 

More generally, we can continue this type of reasoning to prove that for any 
two functions, F and G, of the coordinates and momenta, which can be expanded in 
a power series, the relation 

- FG ) 

hm 

a->o ih 


_ dG dF dF dG | 8G dF 

dx dp x dx ap x ay ap y (14.60) 

aF aG | aG aF aF aG 

dy ap y az ap z az ap z 
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holds, where F and G are the same functions of the ordinary variables as F and G 
are of the corresponding operators. Equation (14.60) is assumed to be valid for any 
smooth functions of coordinates and momenta, even if a power series expansion 
cannot be made. Equations (14.54) and (14.57) are special cases of (14.60). 

In classical mechanics, the expression on the right-hand side of (14.60) is ab- 
breviated as [G, F] P 5 . and is known as the Poisson bracket 1 ( P.B .) of F and G. Dirac 
discovered that this is the classical analogue of the commutator [ G , F]/ih. 


Exercise 14.6. Illustrate the validity of Eq. (14.60) by letting G = x 2 and 
F — pi, and evaluating both the operator expression on the left, in the limit h — » 0, 
and the corresponding Poisson bracket on the right. 


All of these arguments can be summarized in the proposition that the classical 
equation of motion for an arbitrary function A(x, p x , t ), 


dA BA 

— - [A, H]„. + — 


(14.61) 


is the correspondence limit of the quantum equation of motion (14.18), 


d(A) = / [A, H] \ + /dA\ 
dt \ ih J \ Bt J 


(14.62) 


for the operator A(x, p x , t ), which is the same function of its arguments as A(x, p x , t ). 


4 . Canonical Quantization and Constants of the Motion. So far we have con- 
sidered only descriptions of the physical system in terms of Cartesian coordinates 
for a point particle. Yet, the connection between classical and quantum mechanics 
was established by the use of Hamilton’s equations of classical mechanics, which 
are by no means restricted to Cartesian coordinates. Rathej, these equations are well 
known to have the same general form for a large class of variables called canonical 
coordinates and momenta, and denoted by the symbols q and p. Since in the Ham- 
iltonian form of mechanics the Cartesian coordinates do not occupy a unique posi- 
tion, we ask whether the quantization procedure of Section 14.3 could not equally 
well have been applied to more general canonical variables. Could we replace x by 
q and p x by p (assuming for convenience only one degree of freedom), satisfying 
the more general commutation relations 


qp pq = ihl 


(14.63) 


instead of (14.58), and could we still apply the same quantization rules? 

To show that we are indeed at liberty to use canonical variables other than the 
Cartesian ones, we must prove that the same form of quantum mechanics results 
whether we use x, p x , or q, p to make the transition to quantum mechanics. To prove 
that we can pass from the upper left to the lower right corner of Figure 14.1 equiv- 
alently by routes 1, 2 or 3, 4 we first consider an infinitesimal canonical transfor- 


1 Goldstein (1980), p. 397. 




4 Canonical Quantization and Constants of the Motion 


327 


canonical transformation 


x,p x 
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unitary transformation 


Figure 14.1. Classical canonical transformations and quantum mechanical unitary 
transformations. 


mation (step 1 in the figure), i.e., a transformation that is generated by an infinites- 
imal function eG(x, p x ) from the relations 


q 


X + 8 


dG 

dp x ’ 



(14.64) 


The new Hamiltonian is 


H'(q, p) = H(x, p x ) = H(q, p) - 


dU aG 

ax dp x 


aH aG 
ap x ax 


= H(q, p) 4- e[G, H] P jB 


(14.65) 


This canonical transformation is paralleled in quantum theory by step 4. Agreement 
with (14.64) and (14.65) in the correspondence limit is assured if we define the 
Hermitian operators 


q = x + \x, G], p = p x + [p x , G ] (14.66) 

in in 

More generally, for an arbitrary function F(x, p) we find to first order, 

aF aG aF aG 

F(q, p) = F(x, p x ) + £ — £ T _ T _ = F ( x ’ Px) + e t F » (14.67) 

dx ap x ap x ax 

The corresponding operators satisfy the equation 

F(q, p ) = Fix, p x ) + [F, G] (14.68) 

in 

again to first order in e. In quantum mechanics, the new Hamiltonian is constructed 
by correspondence with (14.65) as 

H'{q, p) = H(x, p x . ) = H(q, p) + ^ [G, H] (14.69) 

The Hermitian operator G(x, p x ) is constructed from G(x, p x ) by letting x and p x 
become operators. The commutators are evaluated by applying the quantization rules 
of the last section for Cartesian coordinates. 
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To first order in e, (14.68) may be written as 


is 


F{q,p) = I 1 + ~G\F(x,p x )\l - 


le 

h 


(14.70) 


showing that the new operators are obtained from the old Cartesian ones by an 
infinitesimal unitary transformation (step 4): 

F(q,p) = U e F(x, Px )Ul (14.71) 

with 


is 

u - = I + n G 


(14.72) 


The Hermitian operator G is the generator of this infinitesimal transformation. 

In terms of the new variables, the quantum analogue of the classical Hamilto- 
nian, (14.69), becomes 

H'(q, p) = H(x, p x ) = H(UtqU s , U\qU e ) = U\H{q, p)U e (14.73) 

[See Exercise 9.16 for the conditions under which the last equality in (14.73) holds.] 
The commutation relations are invariant under unitary transformations because 

[q,p\ = U £ xUlU E p x Hl - U E p x UlU e xUt = U e [x, p x ]Ut = ihl (14.74) 

and we have arrived at (14.63). This completes the proof that the quantization rules 
of 14.3 can be extended to new canonical variables that are infinitesimally close to 
Cartesian. 

The quantization procedure based on rules (1) and (2) of Section 14.3 can now 
be immediately generalized to all those canonical variables that can be obtained 
from the Cartesian ones by a succession of infinitesimal canonical transformations. 
This is true because two classical canonical transformations made in succession can 
be replaced by a single direct one. Similarly, in quantum mechanical transforma- 
tions, successive application of unitary operators is equivalent to the application of 
a single unitary operator. If we let e = AIN (where A is a finite parameter and N is 
an integer), and apply the same unitary operator N times, we obtain the in limit 
N — » oo the unitary operator, 

+ = exp (H (14J5) 

This finite unitary transformation changes the Cartesian variables into 

q = UxU\ p = Up x U f • (14.76) 

The commutation relations are also invariant under the finite transformations. 

We note that if (14.76) holds, the eigenvalue spectra of x and q are the same, 
as 

UxU f U\x') = qU\x ') = x'U\x') 

Hence, 

q' = x' and I#') = U\x') 

We see that the quantization of the system can be carried through by the use of 
the general commutation relations (14.63) for any pair of canonical variables that 
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can be obtained from x, p x by a continuous succession of infinitesimal transforma- 
tions. For more general canonical transformations than these, the standard quanti- 
zation procedure may or may not be valid. Clearly, it will be valid whenever the 
new operators can be obtained from the old ones by a unitary transformation. 

A simple example of a failure of the standard quantization is provided by the 
transition to spherical polar coordinates, r, (p , 8. The transformation to these from 
Cartesian coordinates is canonical, but it cannot be generated by a succession of 
infinitesimal transformations, because of the discontinuity of the spherical polar 
coordinates. Nor does a unitary transformation between x, y, z and r, <p, 8 exist, for 
if it did the eigenvalues of the latter operators would have to be the same as those 
of x, y, z, and range from — to +<», contrary to the definition of r, <p, 8. The 
general procedure for expressing the Hamiltonian operator and the Schrodinger 
equation in terms of curvilinear coordinates will be given in the Appendix, Section 3. 

Because of its close connection with the classical canonical formalism, the 
quantization procedure described here is referred to as canonical quantization. The 
correspondence between canonical transformations and unitary operators has led to 
the frequent designation of unitary operators as quantum mechanical canonical 
transformations. This terminology has asserted itself, even though some unitary 
transformations have no classical analogue, and vice versa. 

Exercise 14. 7. Show that the transformation 

UaxW = ax cos © + bp x sin 0 (14.77) 

Ubp x U f = —ax sin 0 + bp x cos 0 

is canonical, if a and b are real-valued constants, and © is a real-valued angle 
parameter. Construct the unitary operator U that effects this transformation. For the 
special case 0 = 7t/2, calculate the matrix elements of U in the coordinate repre- 
sentation. Noting that this transformation leaves the operator a 2 x 2 + b 2 p 2 invariant, 
rederive the result of Exercises 3.8 and 3.21. 

Exercise 14.8. Show that the reflection operator, defined by the relation 
U\x') — | —x'), gives rise to a unitary transformation which takes x into — x and p x 
into —p x . 

An important application of canonical transformations concerns the finding of 
constants of the motion, which are observables that commute with the Hamiltonian 
H (see Section 14.1). 

A useful way of obtaining constants of the motion for a time-independent Ham- 
iltonian operator H(q, p) consists in noting that if a (finite or infinitesimal) canonical 
transformation to new variables q', p' is made, the new Hamiltonian H' is related 
to the old one by the equation 

H(q, p) = H'(q' , p') = UH'(q, p)U f (14.78) 

which is just an extension of (14.73) to finite canonical transformations. If the ca- 
nonical transformation leaves the Hamiltonian invariant, so that the new Hamilto- 
nian H' is the same function of the canonical variables as the old one, 

H’(q,p) = H(q,p ) (14.79) 

then 


UH - HU = [U, H] = 0 
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Hence, 

4 W =^(UH- HU) = 0 

at in 

and thus the unitary operator U is a constant of the motion if the transformation 
leaves H invariant. If, in particular, H is invariant under an infinitesimal transfor- 
mation 


le 


u -- , + l G 


then the (Hermitian) generator G of the transformation commutes with H, 

[G, H] = 0 (14.80) 

and thus G is a constant of the motion. In this way, physical observables are obtained 
which are constants of the motion. 

As an example, consider a free particle whose Hamiltonian is 

H = + Py + Pb (14.81) 


According to (14.66), the generator of infinitesimal translations e, 

sG = —e x p x - s y p y - SzP z = -e • p 
produces no change in the momenta 

p' = P (14.82) 

but, owing to the fundamental commutation relations, changes the coordinates to 

r' = r — 4 [r, *> • p] = r — £ (14.83) 

in 

Thus, the transformation describes a coordinate translation, as expected from the 
connection between momentum and displacement operators. 

Evidently, any H that does not depend on the coordinates is invariant under the 
transformation (14.82), (14.83). Hence, sG = — e ■ p (for arbitrary e) and p itself 
are constants of the motion, and we conclude that linear momentum is conserved 
under these conditions. 

Similarly, for infinitesimal rotations we recall the generator 

sG=-8(j> n L (14.84) 

from Section 11.1. For a vector operator A, such a rotation induces the change 

8A = 4 [A, G] = -^4 [A, n • L] = -S</> n X A (14.85) 

in in 

by (11.19). The operator A • A = A 2 is a rotationally invariant scalar operator, as 
was shown in Section 11.1. 

The Hamiltonian for a particle in a central-force field, 

H = f - + V{r) (14.86) 

2m 

is invariant under rotations, since p 2 and r are scalar operators. Hence, 
eG — ~8<p n-L (for arbitrary vectors n and rotation angles 8(f)) and L itself are 
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constants of the motion, and orbital angular momentum is conserved for a system 
with spherical symmetry. 

Exercise 14.9. Show that if both A and B are constants of the motion, they 
either commute or the commutator i[A, B ] is also a constant of the motion. Prove 
that if the entire spectrum of H is nondegenerate, then A and B must commute. If 
the constants of the motion A and B do not commute, there must be degenerate 
energy eigenvalues. Illustrate this theorem by constructing an example Hamiltonian 
for which A = L x and B = L y are constants of the motion. 2 

5. Canonical Quantization in the Heisenberg Picture. The canonical quantiza- 
tion procedure can be formulated in any of the pictures of quantum dynamics, since 
they are all related to the Schrodinger picture by a generally time-dependent, unitary 
transformation U(t). Such a transformation leaves every algebraic relation between 
operators 

f(A, B, C, . . .) = 0 

formally unchanged (see Exercise 9.16): 

/(A, B,C,...) = f(UAU f , UBU\ UCU f , . . .) = Uf(A, B, C, . . .)U f = 0 
Hence, the canonical commutation relations for conjugate variables 

[q, p] = qp - pq = ifil 
become the same in any picture: 

[<7(0, P(t)\ = (UqU^(UpU f y - (UplTXUqU*) = ihl 
The dynamical law, 

ih = #> (14-87) 

dt 

for any operator A that does not depend on time explicitly also has the same form 
in all pictures. 


Exercise 14.10. Using Eqs. (14.41) and (14.44), verify that the equation of 
motion (14.87) for expectation values of observables holds in any picture of quantum 
dynamics. 

The equations of motion for the canonical variables are derived from (14.41): 


ih 


ih 


dq(t) 
dt 
dp(t ) 
dt 


im h 0 ] 

[pit). Ho] 


(14.88) 


and depend on the choice of H 0 , the model Hamiltonian for the particular dynamical 
picture. 


2 See Fallieros (1995). 



332 


Chapter 14 The Principles of Quantum Dynamics 


In the Heisenberg picture ( H 0 = H), the equation of motion can be expressed 
for any dynamical variable 

F(q, p, t ) s F(q(t), pit), t) 
according to (14.30) in summary form as 

(14.89) 

By (14.61), this is the quantum analogue of the classical equation of motion, 

4 P. ? ) = [ F (<h P> *). H] P S . + 4 F(q. P, 0 (14.90) 

at ot 

The formal similarity between the classical equation of motion for canonical 
variables and the quantum equations for the corresponding operators, and not merely 
their expectation values, confers a measure of distinction on the Heisenberg picture 
of quantum dynamics, which otherwise is just one of infinitely many (unitarily) 
equivalent pictures. In the Heisenberg picture, the transition from classical to quan- 
tum theory for a system that has a classical analogue is made simply by replacing 
the canonical variables by operators that can change in time, subject to the com- 
mutation relations 

Gfc(0. Pf(0] = ihs u (14.91) 

[qM, q e m = [p k (t), PM = o (14.92) 

and by postulating that if the classical equations of motion are expressed in terms 
of Poisson brackets, the correspondence 

(14.93) 

is to be made — barring, as usual, complications that may arise from ambiguities in 
the ordering of operators. 

The simple commutation relations (14.91) and (14.92) are not valid if the op- 
erators are taken at two different times. Thus, generally, q(t) and g(0) do not com- 
mute, nor is the commutator [#(0), pit)] equal to ihl. For example, if the system is 
a free particle in one dimension with H = p 2 /2m, we have 

pit) = p( 0) and q(t) = q( 0) + ^ t (14.94) 

m 

hence, 

[q(t),qm = -~t (14.95) 

m 

If this commutation relation is applied to (10.54), we find the uncertainty relation 

ft 

(A q) t ibq) 0 - Aqmm > — |t| (14.96) 

which shows that if the particle moves freely, its wave packet in the coordinate 
representation must spread in the long run as jt| -» °o. In Eq. (14.96), the notation 
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on the left-hand side emphasizes that the variances, being expectation values, are 
independent of the choice of the quantum dynamical picture. Except for denoting 
the coordinate by q rather than x, this inequality provides a precise formulation of 
the statements about wave packets contracting long before the present time and 
spreading in the distant future, that were made in Section 2.4. If the particle is free, 
initially narrow wave packets spread more rapidly than those that initially are broad. 

If the initial wave packet at t = 0 is the minimum uncertainty wave packet 
(10.66), with 


kq(0)kp = | 


then, using (14.96), 


A q(t) 


MAp 

m 


This inequality is consistent with the result of an explicit calculation of the time 
dependence of the variance of q for a wave packet that has “minimum uncertainty” 
at t = 0: 


(A*)? = (A qf Q + 


(A p?e 

2 


(14.97) 


Exercise 14.11. For a free particle in one dimension and an arbitrary initial 
wave packet, calculate the time development of (A q)j and show (as in Problem 2 in 
Chapter 3) that 

t (Ad) 2 ? 2 

(A qfi = (A?)o + - \(qp + pq) o “ 2 (q) 0 (p)] + ' 2 (14.98) 

m m 

Verify that for the minimum uncertainty wave packet this result agrees with (14.97). 
Also compare with the value of the variance (A q) 2 as a function of time for a beam 
of freely moving classical particles whose initial positions and momenta have dis- 
tributions with variances (Aq)o and (A/?) 2 . 


If the close correspondence between the classical theory and quantum dynamics 
gives the Heisenberg picture a certain preferred status, the Schrodinger picture is 
perhaps a more intuitive form of quantum mechanics. The Schrodinger picture is 
particularly suitable for a discussion of scattering processes, which are more natu- 
rally described by moving wave packets, albeit complex-valued ones, than by op- 
erators changing in time. From our present general point of view, however, it is a 
matter of taste and convenience whether we navigate in the Heisenberg or the Schro- 
dinger picture, or any other picture. Once again, the harmonic oscillator offers the 
simplest nontrivial illustration. 

In the Heisenberg picture, the oscillator Hamiltonian may, similar to the Schro- 
dinger picture form, be written as 


H — htai a^(t)a(t) + - 


(14.99) 
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Applying (14.30), we obtain the equation of motion for the lowering (annihilation) 
operator a(t), 

ih ^ 7 -^ = [a(t), H] — h(o[a(t), a(t) f ]a(t) 
at 

Using the commutation relation (10.72), transcribed into the Heisenberg picture, we 
find the simple differential equation 

+ i<oa(t) = 0 (14.100) 

at 


Although the operator equations of motion are difficult to solve directly in most 
problems, necessitating passage to a representation in which these equations become 
conventional systems of linear differential and integral equations, the present ex- 
ample is an important exception. Equation (14.100) can be solved immediately, even 
though a(t ) is an operator: 

ait) = a(t Q ) (14.101) 


Similarly, 


a + (t) = a* (t 0 ) e'“ (, -' o) 


(14.102) 


If we choose t 0 = 0, the initial values of the operators a(0) = a and u + (0) = a + 
are two mutually adjoint operators that satisfy the commutation relation (10.72). 
They are the raising (creation) and lowering (annihilation) operators in the Schro- 
dinger picture, which coalesces with the Heisenberg picture at t — 0; they serve as 
the constants of integration of the dynamical problem. The canonical variables p 
and q, and any function of these, can be expressed in terms of a and a 1 ". Thus, in 
principle the equations of motion have all been integrated. 


Exercise 14.12. Work out directly from (14.27), 

a(t ) = exp^ Ht^j a exp^ — — Hfj 

as an application of the identity (3.59). Then determine q(t) and p(t). 


We now choose a basis and introduce a fixed representation. The most conve- 
nient one is the same as in Section 10.6: A basis supported by the eigenvectors of 
H. This may be called the energy representation. The matrices representing a(t) and 
a + (f) are obtained by multiplying the matrices (10.90a) and (10.90b) representing a 
and a f by e~ lwt and e l0)t , respectively. In the energy representation of the Heisenberg 
picture, the coordinate operator of the harmonic oscillator is explicitly represented 
by the matrix: 



[<z(0 + a f (0] 


/ 0 



V\e~ io,t 

0 

V2e i0it 


0 

0 



q(t) = 


(14.103) 
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This matrix is evidently Hermitian, as it should be. Its elements are harmonic func- 
tions of time — a general property of the matrix representing any Heisenberg operator 
in an energy representation that is supported by the fixed eigenvectors of H : 

H\E) = E\E) 


Indeed, for any operator, which is not explicitly time-dependent, 


A(t) — exp — Ht A exp ~jHt 




h 


giving us the matrix element 


(E’\A(t)\E") = (E'\A\E") exp 


- (E’ - E")t 
n 


(14.104) 


The special feature of the harmonic oscillator as a perfect periodic system is that all 
matrix elements oscillate with integral multiples of the same frequency, a>. 


Exercise 14.13. In either the Heisenberg or Schrodinger picture, show that if 
at t = 0 a linear harmonic oscillator is in a coherent state, with eigenvalue a, it will 
remain in a coherent state, with eigenvalue ae~ lwt , at time t. 


6. The Forced Harmonic Oscillator. For many applications, especially in many- 
body and field theory, it is desirable to consider the dynamical effects produced by 
the addition of a time-dependent interaction that is linear in q to the Hamiltonian of 
the harmonic oscillator: 


n 2 1 

H = — + ~ ma) 2 q 2 - qQ(t) 
2m 2 


where Q(t ) is a real-valued function of t. This perturbation corresponds to an external 
time-dependent force that does not depend on the coordinate q ( dipole interaction). 
With no additional effort, we may generalize the Hamiltonian even further by intro- 
ducing a velocity-dependent term: 

2 i 

H = ^ + - mco 2 q 2 - qQ(t) ~ pP(t) (14.105) 

where P(t) is also a real function of t. 

With the substitutions (10.69) and (10.71), the Hamiltonian (14.105) may be 
cast in the form 

H = fool o' a + - ] + f(t)a + /*(0^ f 

V 2/ (14.106) 

= h(o(d\t)a(t) + ^ + f(t)d(t) + f*(t)d'(t) 


in either the Schrodinger or Heisenberg picture, provided that we define the com- 
plex-valued function f(t): 



< 2(0 + i 


hma) 


m 


m = - 


(14.107) 
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In most applications, we are interested in the lasting rather than the transient 
changes produced by the time-dependent forces in an initially unperturbed harmonic 
oscillator. It is therefore reasonable to assume that the disturbance f{t ) ± 0 acts 
only during the finite time interval T x < t < T 2 and that before T l and after T 2 the 
Hamiltonian is that of a free oscillator. The time development of the system is 
conveniently studied in the Heisenberg picture, in which the state vector I 1 ?) is 
constant, and the operators are subject to a unitary transformation as they change 
from the free oscillation regime before T x to a free oscillation regime after T 2 . 

Using the equal-time commutation relation, 

[a{t\ aft)] = 1 (14.108) 

we derive the equation of motion 

ifi — [ait), H] = hto a(t) + f*(t) 
at 


or 


da(t) i 

— ~ + i(o d(t) = -- f*(t ) (14.109) 

at n 

This inhomogeneous differential equation is easily solved by standard methods. For 
instance, it can be multiplied by e l0>t and cast in the form 

~{a(t)e iMt } = ~f*(t)e iM 

which can ther? be integrated to produce the general solution 

aft) = d(t 0 )e- ib>(t ~ - 7 f e- iw(t ~ n f*(t') dt' (14.110) 

n Jt 0 

If we choose t 0 = 0, this equation simplifies to 

d(t) = ae~ im - 7 f e- i<o(t - n f*(t') dt’ (14.111) 

n Jo 

Although it calls for unnecessarily heavy artillery, it is instructive to review 
the solution of (14.109) by the use of Green’s functions to illustrate a method that 
has proved useful in many similar but more difficult problems. 

A Green’s function appropriate to Eq. (14.109) is a solution of the equation 

dG{t ~ 1 ] + i(li G(t - t') = 8(t - t') (14.112) 

dt 

because such a function permits us to write a particular solution of Eq. (14.109) as 

d{t) - -7 [ G(t- t')f*(t') dt' (14.113) 

ft J — 00 

This is easily verified by substituting (14.113) into (14.109). Obviously, for t 4= t' 
the Green’s function is proportional to but at t = t' there is a discontinuity. 

By integrating (14.112) over an infinitesimal interval from t' — e to t' 4- e, we 
derive the condition 

lim[G(+e) - G(~e)] = 1 (14.114) 

s — >0 


for e > 0. 
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Two particular Green’s functions are useful: 

G R (t ~ t') = tj (t - t’)e~ i<0 


(14.115) 


and 


G A (t - t') = -7] it' - t)e- i,o0 ~ n 


(14.116) 


where the Heaviside step function rj(t) = 0 for t < 0 and t j(t) = 1 for t > 0 (see 
Appendix, Section 1). These two particular solutions of (14.112) are known as re- 
tarded and advanced Green’s functions, respectively. 

We note that 


ait) 



G R (t - t')f*(t') dt' 


is the particular solution of (14.109), which vanishes for t < 7\. Similarly, 


a(t) 


= -1 f 

h J- 


G A (t ~ t')f*(t') dt' 


is the particular solution of (14.109) which vanishes for t > T 2 . 

If we denote by a in (t ) and a out (t ) those solutions of the homogeneous equation 


da(t) 

dt 


+ iojait) = 0 


which coincide with the solution a(t ) of the inhomogeneous equation (14.109) for 
t < Ti and t > T 2 respectively, and if we choose t 0 = 0, we can write 


m = ajf ) - l ~ J_ + J G R (t - t')f*(t') dt' 
= a in e- iwt - l ~ e-‘“ (t - n f*(t') dt' 


(14.117) 


or, alternatively, 


a(t) a out (t) 


a nut e 


i jZ GAt ~ nf * (n dt ' 

+ L [ e - , "( f - f ')/*(t') dt' 

n Jt 


(14.118) 


Both (14.117) and (14.118) are equivalent to solution (14.111). By equating the 
right-hand sides of (14.117) and (14.118), we obtain the relation 


a, 


out 


- - g*(a>) 


(14.119) 


where 



e~ i<ot 'f(t') dt' 



e~ iw, 'fit') dt' 


(14.120) 


is the Fourier transform of the generalized force f(t). 
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The solution (14.117) or (14.118) in the Heisenberg picture can be used to 
answer all questions about the time development of the system. It is nevertheless 
instructive also to consider the dynamical problem from the point of view of other 
pictures. 

To implement the interaction picture, we regard the Hamiltonian of the forced 
oscillator as the sum, H = H 0 + V(t), of an unperturbed Hamiltonian 

H 0 = ha)(a*a + 0 (14.121) 

and an explicitly time-dependent “interaction” term, 

V(t) = f(t)a + f*(t)a f (14.122) 

Time-dependent Hamiltonians require more careful treatment than time-independent 
ones, because generally the interaction operators at two different times do not com- 
mute. 

We choose the unperturbed Hamiltonian operator H 0 as the model Hamiltonian 
to define the interaction picture. According to (14.45), the transformed interaction 
operator is 

V(t) = e i0ia ' at [f(t)a + f*(t)a r ]e- iwa ' at (14.123) 

The interaction operator can be evaluated by use of the identity (3.59), since 
[a f a, a] = —a and [a f a, a f ] = a 1 . We thus obtain 

V(t) = f(t)ae~ iwt + f*(t)a^e iwt (14.124) 

The equation of motion for the state vector in the interaction picture is 

ih y |*(f)> - {f(t)ae~ io>t + /*(0aV"'} l*(0> (14.125) 

at 

Similarly, the time development operator T(t 2 , tf) in the interaction picture as defined 
in Eq. (14,49) satisfies the equation of motion: 

ih = V(t)T(t, h) (14.126) 

at 

Integration of (14.126) over the time interval (t 1; t) and use of the initial condition 
f(t u r,) = 1 produce an integral equation for the time development operator: 

T(t, h ) = 1 ~l [ V(t')f(t', h) dt’ (14.127) 

A formal solution of this equation can be constructed by successive iteration: 

Tit , h) = 1 - l - j' V(t') dt' + ^ W) dt' V(t") dt" ± ... (14.128) 

It is sometimes convenient to write this series expansion in a more symmetric form 
by using the time-ordered product of operators. We define time ordering of two 
operators as 


T[V(t')V0")] = 


V(t')V(t") if t" < t' 
V(t")V(t') if t' < t" 


(14.129) 
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This convention is easily generalized to products of any number of time-dependent 
operators. With it we can prove that if t > t u the time development operator may 
be written in the form 


Tit, *i) = 1 + 2 f-r) f f • • • f dt[dt' 2 . . . dt' n Tm[m'2) ■ ■ • V(t'n)] 

«=i n\ \ nj Jt x Jti Jty 


or formally and compactly as 


T(t, t{) = T exp 


4j> 


) dt' 


(14.130) 


(14.131) 


To prove that (14.130) is a solution of (14.126), it is sufficient to write to first order 
in e, 

T(t + e, f0 = 1 + £ ^ 6 £ * . . j‘ dt[df 2 . . dt' n T[V(t[mt^ . . V(t'n)\ 

= T(t, h) + e £ ^ nV(t) j‘ j‘ . . j‘ dt[dt' 2 . . dfi-aTt^I)^) . . 


or 


f(r + s, tj) - T{t, tO 
s 


V(0f(t, to 
n 


which in the limit e — > 0 reduces to (14.126). 

If the formula (14.131) is applied to the forced linear harmonic oscillator, with 
the interaction potential (14.124), we obtain 


T(t, *i) = T exp-j 






(14.132) 


Although this is a compact expression for the time development operator, because 
of the presence of the unwieldy time ordering operator it is not yet in a form con- 
venient for calculating transition amplitudes. 

To derive a more manageable formula, we use the group property (14.8) and 
write 


T{t, t x ) = lim e VN e VN ~ l e VN ~ 2 . . . e V2 e Vl (14.133) 

N — >oo 


where, by definition, 

i rti + ke 

V k = V(t') dt' and Ns = t - t t (14.134) 

n Jh+(k- i) E 

Expression (14.133) is valid, even if the interaction operators at different times do 
not commute, because the time intervals of length e are infinitesimally short and are 
not subject to internal time ordering. The expression (14.133) can be further reduced 
if the commutators [E(i'), V(f")] are numbers for all t' and t". This is indeed the 
case for the forced harmonic oscillator, since according to (14.124), 

[%'), W)] = fit')f*{t")e- i<a(t '- n - f*(t l )Kt")e ,0j(t '- n 


(14.135) 
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The identity (3.61) can be applied repeatedly to give 

T(t, h ) = lim expJ ^ ( V* + X 
(*=1 \ ^ 

or, if the limit N — > °° and e -» 0 is carried out, 

T(t, h) = exp|-^ £ V(t') A' - ~ £ df £ dt"[V(t’\ %")]} (14.136) 

For the forced harmonic oscillator, inserting (14.124) and (14.135) into (14.136), 
we thus obtain the time development operator in the interaction picture in the desired 
form: 

T(t, r 3 ) = e muh) exp{ — £*(t, h)a + £(t, t^} (14.137) 

where we have defined 

f e^'f*(t f ) df (14.138) 

n Jr, 



This expression can be connected with the Fourier integral of the applied force given 
in (14.120): 


g{a>) = TO = -ih£*(- oo, +oo) (14.139) 


In (14.137), the real phase (3 appears as the price we must pay for eliminating the 
time ordering operator, and it stands for: 


m h) = £ dt' £ dt"[f(t')f *(t")e- 


iwO'-t") 


f*(f)f(t")e io>(t '~ n } (14.140) 


If the initial state at t = ti is a coherent oscillator state | a), as defined in Section 
10.7, the state at time t is 


1 ^( 0 ) = T{t,h)\a) = 

— g + or)a+ -(£* + a*)a | Q) (14.141) 

= e iyit ’ h) \ a + 0 

where y, like [3, is a numerical phase. We arrive at the intriguing and important 
conclusion that, under the influence of the (dipole) interaction f(t)a + f*(t)a f , a 
coherent state remains coherent at all times, because the time development operator 
(14.137) is a displacement operator for coherent states, like the operator D in 
(10.98). 

Of particular interest is the limit of the operator fit, tf) as t x — °° and 
t — » +oo. This limiting time development operator is known as the S (or scattering) 
operator and is defined formally as 


S = f(+ oo, -oo) 


(14.142) 


For the forced harmonic oscillator with an interaction of finite duration during the 
interval (7\, T 2 ) the S operator is 


(14.143) 
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where we have denoted 


jS = j8(+°o, -“) 

Substituting the expression for g(ca) defined in (14.120), we obtain 

[f(t)ae- iwt + f*(t)a f e im ] dtl 


S = e tp exp 


Hj: 


(14.144) 


(14.145) 


As the link between the states of the system before the onset and after the cessation 
of the interaction, the scattering operator, or the S matrix representing it, was first 
illustrated in Section 6.3. We will encounter the same concepts again in Chapter 20. 

If the oscillator is in the ground state before the start of the interaction, whafr 
is the probability that it will be found in the nth excited oscillator energy eigenstate 
after the interaction terminates? The interaction produces the state SjO), which is a 
coherent state with eigenvalue a = —(i/h)g*((o). The transition probability of find- 
ing the oscillator in the nth eigenstate after the interaction is, according to (10.1 10), 
the Poisson distribution 


|(n|5|0>| 2 = \(n\a = -ig*((o)lh ) | 2 - P n (a) 


1 g(«) 


n! 


h 


2 n 

e -\g(<o)\ 2 /h 2 



(14.146) 


with expectation value (n) = \g{(o)\ 2 lh 2 for the oscillator quantum number. These 
results can be interpreted in terms of a system of quanta, n being the number of 
quanta present. The interaction term in the Hamiltonian is linear in a 1 and a and 
creates or annihilates quanta. The strength of the interaction determines the average 
number ( n ) of quanta present and characterizes the Poisson distribution, which rep- 
resents the probability that a dipole interaction pulse incident on the vacuum state 
of our system of quanta leaves after its passage a net number of n quanta behind. 
These features of the dynamics of the forced or driven linear harmonic oscillator 
will help us understand the creation and annihilation of photons in Chapter 23. 

Finally, we use the results from the interaction picture to deduce the time de- 
velopment operator in the Schrodinger picture. From Eq. (14.49) we infer that 


T(t 2 , f,) = e- (i,h)H ^f(t 2 , tl )e m)H ^ (14.147) 


If we employ the oscillator Hamiltonian (14.121) for H 0 and the time development 
operator (14.137) in the interaction picture, we obtain 


T{t 2 , t x ) = exp ti)ae iMt2 + £(t 2 , 


(14.148) 


Exercise 14.14. If [A, B] = y B (as in Exercise 3.15) prove that 

^/(B)*? - ^ = f(e ky B) (14.149) 

Exercise 14.15. Verify the expression (14.148) for the time development op- 
erator by applying (14.149) and (3.61). 

Exercise 14.16. Show that 


S f aS = a 


i 

- g*((o) = a + a out - a in 


(14.150) 
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where S is the operator defined in Eq. (14.142) and a in and a out are related by 
(14.119). Noting that the operators a, a in , and a out differ from each other by additive 
constants, and using the unitarity of 5, deduce that 

S f a in S = a OM (14.151) 

Exercise 14.17. For a forced harmonic oscillator with a transient perturbation 
(14.106), derive the change in the unperturbed energy, if IM?) is the initial state of 
the oscillator in the interaction picture, at asymptotically long times before the onset 
of the interaction. Show that 

A E = ih)g{(o)(a ) — ia)g*(cS)(a t ) + <y|g(&))| 2 //i (14.152) 

If |''P) is the ground state of the oscillator, verify that A E given by (14.152) agrees 
with a direct calculation based on the Poisson distribution formula (14.146). 


Problems 


1. A particle of charge q moves in a uniform magnetic field B which is directed along 
the z axis. Using a gauge in which A z = 0, show that q = ( cp x — qA x )/qB and 
p — ( cp y - qA y )/c may be used as suitable canonically conjugate coordinate and 
momentum together with the pair z, p z - Derive the energy spectrum and the eigen- 
functions in the ^-representation. Discuss the remaining degeneracy. Propose alter- 
native methods for solving this eigenvalue problem. 

2. A linear harmonic oscillator is subjected to a spatially uniform external force 
F(t) — Cr](tye~ Xt where A is a positive constant and rj(t) the Heaviside step function 
(A.23). If the oscillator is in the ground state at t < 0, calculate the probability of 
finding it at time t in an oscillator eigenstate with quantum number n. Assuming 
C — ( hmX 3 ) 1 ' 2 , examine the variation of the transition probabilities with n and with 
the ratio A/w, oj being the natural frequency of the harmonic oscillator. 


3. If the term V(t) in the Hamiltonian changes suddenly (“ijnpulsively”) between time 
t and t + At, in a time A t short compared with all relevant periods, and assuming 
only that [V(t'), V(t")] = 0 during the impulse, show that the time development op- 
erator is given by 


T(t + A t, t) — exp 



Note especially that the state vector remains unchanged during a sudden change of 
V by a finite amount. 

4 . A linear harmonic oscillator in its ground state is exposed to a spatially constant force 
which at t = 0 is suddenly removed. Compute the transition probabilities to the 
excited states of the oscillator. Use the generating function for Hermite polynomials 
to obtain a general formula. How much energy is transferred? 

5. In the nuclear beta decay of a tritium atom ( 3 H) in its ground state, an electron is 
emitted and the nucleus changes into an 3 He nucleus. Assume that the change is 
sudden, and compute the probability that the atom is found in the ground state of the 
helium ion after the emission. Compute the probability of atomic excitation to the 25 
and 2 P states of the helium ion. How probable is excitation to higher levels, including 
the continuum? 
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6. A linear harmonic oscillator, with energy eigenstates |rc), is subjected to a time- 
dependent interaction between the ground state 1 0) and the first excited state: 

W) = F(i)|l)<0| + F*(r)|0)(l| 

(a) Derive the coupled equations of motion for the probability amplitudes 

(n|^(0>. 

(b) If F(t ) = \Z2ha)r)(t), obtain the energy eigenvalues and the stationary states 
for t > 0. 

s 

(c) If the system is in the ground state of the oscillator before t = 0, calculate 
for t > 0. 

7. At t < 0 a system is in a coherent state |a) (eigenstate of a) of an oscillator and 
subjected to an impulsive interaction 

V(t) = ~ £(a 2 - a f2 )8(t) 

where f is a real-valued parameter. Show that the sudden change generates a squeezed 
state. If the oscillator frequency is (o, derive the time dependence of the variances 




CHAPTER 15 


The Quantum Dynamics of a Particle 


In this chapter, we develop quantum dynamics beyond the general 
framework of Chapter 14. Generalizing what was done in Section 9.6 for 
a particle in one dimension, we summarize the three-dimensional quantum 
dynamics of a particle of mass m (and charge q) in the coordinate or 
momentum representations. In the coordinate representation, the time 
evolution of any amplitude is compactly, and for many applications 
conveniently, derived from the initial state in integral form in terms of a 
propagator (Green’s function). This view of the dynamics of a system 
leads to Feynman’s path integral method, to which a bare-boned 
introduction is given. Since most quantum systems are composed of more 
than just one particle in an external field, we show how the formalism is 
generalized to interacting compound systems and their entangled states. 
Finally, in the quantum domain it is often not possible to prepare a 
system in a definite (pure) state I'P'). Instead, it is common that our 
information is less than complete and consists of probabilities for the 
different pure states that make up a statistical ensemble representing the 
state of the system. We extend the principles of quantum dynamics to the 
density operator (or density matrix) as the proper tool for the description 
of the partial information that is generally available. From information 
theory we borrow the concept of the entropy as a quantitative measure of 
information that resides in the probability distribution. 

1. The Coordinate and Momentum Representations. We now apply the quanti- 
zation procedure of Section 14.3 to the dynamics of a particle in three dimensions. 
No really new results will be obtained, but the derivation of the familiar equations 
of Section 9.6 from the general theory will be sketched, with appropriate notational 
changes. Practice in manipulating the bra-ket formalism can be gained by filling in 
the detailed proofs, which are omitted here. 

We assume that the Hermitian operators r and p each form a complete set of 
commuting canonically conjugate observables for the system, satisfying the com- 
mutation relations (14.58) and (14.59), compactly written as 

rp - pr = ifil (15.1) 

and that the results of the measurement of these quantities can be any real number 
between — and + <». In Eq. (15.1), the bold italic 1 denotes an operator in Hilbert 
space as well as the identity matrix (dyadic) in three dimensions. 

The eigenvalues of r, denoted by r', form a real three-dimensional continuum. 
The corresponding eigenkets are denoted by Jr'), and we have 

r|r') = r'|r'> (15.2) 

with the usual normalization 

<r"[r'> - S(r' - r") (15.3) 
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in terms of the three-dimensional delta function. The closure relation, 

j Ir'Xr'IiV = 1 


(15.4) 


is the main computational device for deriving useful representation formulas. 

The (coordinate) wave function i//(r ') is the probability amplitude for a state 

|¥>: 

<K r') = <r'|¥> (15.5) 

and the momentum wave function is analogously defined as 


0(p') = <pW (15.6) 

The action of an operator A on an arbitrary state | 'T') is expressed by 

A|¥> = J J |r">(r"|A|r , )<r'|^> d 3 r" d 3 r' = J |r"><r"|A|r')iA(r') d 3 r " d 3 r' 

= 11 |p"Xp"|a|p'XpW d 3 p"dy = j | p "Xp"|a|p'>(Mp') d 3 P " dy 


(15.7) 


The matrix elements of an operator A in the coordinate or momentum representation 
are continuous rather than discrete functions of the indices. They can be calculated 
from the definition (15.2) and the commutation relations (15.1). If A = F(r, p) is 
an operator that is a function of position and momentum, care must be taken to 
maintain the ordering of noncommuting operators, such as x and p x , that may occur. 
If F(r, p) can be expressed as a power series in terms of the momentum p, we can 
transcribe the derivation in Section 9.6 to obtain 

<r"|F(r, p) | r ' > = F^r", y V"W' - r") (15.8) 


Equations (15.7) and (15.8) provide us with the necessary ingredients for rederiving 
the formulas in Section 3.2. For example, after substituting (15.8) and performing 
the r" integration, we can obtain from (15.7), 


<^|F(r, p)'|¥ fl > = j V' W') d 3 r' (15.9) 


in agreement with Eq. (3.31). 

Once we are at home in the coordinate representation, 1 the transformation 
r -» p and p -» — r, which leaves the commutation relations (15.1) invariant, may 
be used to convert any coordinate-representation formula into one that is valid in 
the momentum representation. 


Exercise 15.1. Construct the relations that are analogous to (15.8) and (15.9) 
in the momentum representation. 


‘The coordinate representation employed here is often also referred to as the Schrodinger rep- 
resentation. Since we chose to attach Schrodinger’ s name to one of the pictures of quantum dynamics, 
we avoid this terminology. 
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Exercise 15.2. Prove that 


<r'|F(r, p)|¥> = F^r'.y V r ,j<r'|¥> = F^r' 
<P'|F(r, p)|¥> = F(ihV P ’, p'Xp'I^P) = F(ihV 


, y V r .^( r') 

pS p')^(p') 


(15.10) 


Equations (15.10) can be used to translate any algebraic equation of the form 

F(r, p)|M r > = 0 (15.11) 

in the coordinate and momentum representations as 

f(t', ~ V r .^( r') = F(ihV P ’, p')^(p') = 0 (15.12) 

As an application, the eigenvalue equation for momentum, 

pIp'> = p'Ip'> 

in the coordinate representation takes the differential equation form 

fw<r’|p-> = p'(r’lp’) (15.13) 

for the momentum-coordinate transformation coefficients (r'jp'). The solution of 
this differential equation is 

<r ' | p ') = g(p')e m)p ' T ' (15.14) 

where g( p') is to be determined from a normalization condition. In (15.14) there is 
no restriction on the values of p ' other than that they must be real in order to keep 
the eigenfunctions from diverging in coordinate space. The eigenvalues are thus 
continuous, and an appropriate normalization condition requires that 

<P'| P ") = S(p\- p") = J <p"|r')<r'|p'> d 3 r' 

= g*(p'W) I e (W_p " >r ' d 3 r' = (2^) 3 |g(p')| 2 5(p' - p") 

Hence, 


g(p') = : 




(27rA) 


: 3/2 


Arbitrarily, but conveniently, the phase factor is chosen to be unity so that finally 
we arrive at the standard form of the transformation coefficients. 


<r'|p’> 


Ai/h)p'-r' 

(27 Th) 3 ' 2 


(15.15) 


These probability amplitudes linking the coordinate and momentum representations 
allow us to reestablish the connection between coordinate and momentum wave 
functions for a given state |i|j): 


<Mp') 


<p'|*> = J <p'|r')<r'|¥> d 3 r' 

= (2t Th)~ 3 ' 2 f d 3 r' r (B) P' r >( r') 


(15.16) 
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and its Fourier inverse, 

■K r') - (2 tt^)" 3/2 j d 2 p' e m)p ' r ' 4>(p') (15.17) 

These relations are, of course, familiar from Chapter 3. 

There is a fundamental arbitrariness in the definition of the basis |r') of the 
coordinate representation, which has consequences for the wave function and for the 
matrix elements bf operators. This ambiguity arises from the option to apply to all 
basis vectors a unitary transformation that merely changes the phases by an amount 
that may depend on r'. The new coordinate basis is spanned by the basis vectors 

\F) = e »(r)/*| r '> = ^<P(r'y*| r ^ (15.18) 

where cp(r) is an arbitrary scalar field. In terms of this new coordinate representation, 
the wave function for state I'P) changes from = (r'j'F) to 

(r 7 ]^) = 4f(r') = g- f <p(r'y#^r( r ') (15.1.9) 

In the new basis, the matrix elements of an operator F(r, p) are 

<P|F(r, p)|r^> = (r" [ e~ itp(r)/h F(r, p)e i(p<r),n \r') (15.20) 

V" - V"<pjs(r' - r") 

If, when calculating matrix elements in the new basis, we are to keep the rule (15.8), 
we must make the transformation 

F( r, p) -> F( r, p + V<p) (15.21) 

in order to cancel the undesirable extra term on the right-hand side of (15.20). The 
replacement (15.21) is consistent with the commutation relations for r and p, since 
the addition to the momentum p of an arbitrary irrotational vector field, 
V(r) = V<p(r), leaves the conditions (15.1) invariant. The ambiguity in the choice 
of coordinate basis is seen to be a manifestation of the gauge invariance of the 
theory, as discussed in Section 4.6 (except that here we have chosen the passive 
viewpoint). 


= (r" j F(r, p — V<p|r') - F r", y 


Exercise 15.3. Prove that the commutation relations (15.1) remain valid if the 
substitution p — » p + V(r) is made, provided that the vector field V is irrotational, 
i.e., V X V — 0. 


When we apply the rules (15.12) to the equation of motion (14.12), we obtain 
in the coordinate representation the differential equation 

ih £ <r'|¥(0> - (r'\H\nt)) = H\r', y V' j<r'|¥(0> (15.22) 

for the time-dependent wave function 

*A(r\ t) = <r'|¥(f)> (15.23) 

and similarly in the momentum representation, 


ih - (p'lTO) = <j>’\H\V(!)) = H(ihV v ., p')(p'|^(0} 

dt 


(15.24) 
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for the momentum wave function 

0(P\O = <P W0> (15.25) 

If the Hamiltonian operator H has the common form 

P 2 

H = ~n + Hr) (15.26) 

we obtain from (15.22), unsurprisingly, the time- dependent Schrodinger or wave 
equation, (3.1): 

ih Z &(■ r '> 0 = 

at 

In the momentum representation, we obtain in a similar fashion 
d /• 

- </>(p'> i) = ^ <Mp\ 0 + J (p' I Hr) |p") d 3 //' <Mp", 0 (15.28) 

The matrix element <p ' | V(r)|p") is a Fourier integral of the interaction potential. 

Exercise 15.4, By expressing (p' | V(r)|p") in the coordinate representation, 
verify the equivalence of (15.28) and (3.21). 


h 2 

— V' 2 + V(r') 
2m 


<Kr', 0 


(15.27) 


We have come full circle and have found wave mechanics in its coordinate or 
momentum version to be a realization of the general theory of quantum dynamics 
formulated in abstract vector space. The equivalence of these various forms of quan- 
tum mechanics shows again that the constant h was correctly introduced into the 
general theory in Chapter 14. Planck’s constant has the same value in all cases. 


2. The Propagator in the Coordinate Representation. If the system is a particle 

in an external electromagnetic (gauge vector) field described by the potentials 
A(r, t) and </>(r, t), the Hamiltonian operator 


H = 




(15.29) 


must be used for the transcription of the equation of motion into the coordinate 
(or momentum) representation. Here, the potential energy V includes the fourth 
component of the electromagnetic potential. Generally, the Hamiltonian is time- 
dependent. Hamiltonians for more complex systems must be constructed appropri- 
ately when the need arises. 

Since we usually choose to work in one particular dynamical picture and one 
specific representation, the cumbersome notation that distinguishes between differ- 
ent pictures and between operators and their eigenvalues (by the use of primes) can 
be dispensed with. In the Schrodinger picture and the coordinate representation, the 
equation of motion is 


ih — <A(r, 0 

at 



<A(r, t) 


(15.30) 
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In order to transcribe the equation of motion (14.11) for the time development 
operator in the coordinate representation, we define the propagator as the transition 
amplitude, 

= <r[rg,Q|r')l (15.31) 

the propagator satisfies the initial condition (for equal time t)\ 

K( r, r'; t, t ) - 5(r - r') (15.32) 

and is a nonnormalizable solution of the time-dependent Schrodinger equation 

(15.33) 

The propagator can also be identified as a (spatial) Green’s function for the time- 
dependent Schrodinger equation. 

Exercise 15.5. From the definition (15.3 1) of the propagator and the Hermitian 
character of the Hamiltonian show that 

K( r', r; t', i) = K*(r, r'; t, t') (15.34) 

linking the transition amplitude that reverses the dynamical development between 
spacetime points (r, t) and (r\ t') to the original propagator. 

From its definition (15.31) and the composition rule (14.3) we see that the 
propagator relates the Schrodinger wave functions at two different times: 

(15.35) 

This relation justifies the name propagator for K. Equation (15.35) can also be read 
as an integral equation for the wave function; K is its kernel. 

Exercise 15.6. If a gauge transformation is performed, as in Section 4.6, what 
happens to the propagator? Derive its transformation property. 

The retarded Green’s function, defined as G R = 0 for t < t' , is related to the 
propagator by 

G R (r, f, r', t') = -j 7} (t - t')K{ r, r'; t , t') (15.36) 

n 

and satisfies the inhomogeneous equation 

ih — = H(t)G R + S ( r - r')S(t - t') (15.37) 

dt 

The propagator and the retarded Green’s function are at the center of Feynman’s 
formulation of quantum mechanics, to which a brief introduction will be given in 
Section 15.3. 
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Exercise 15.7. Prove Eq. (15.37). 


The corresponding advanced Green’s function is defined as 

G a (t, t; r\ t r ) = ~ rj(t' - t)K(r, r'; t, t') (15.38) 

n 

It satisfies the same equation (15.37) as the retarded Green’s function. Since for the 
Heaviside step function: 

y(t) + y(-t) = 1 (15.39) 

we note that the two Green’s functions, which solve the inhomogeneous equation 
(15.37), are connected with the propagator (which is a solution of the Schrodinger 
equation without the inhomogeneous term) by 

G R -G A =-~K (15.40) 


If the Hamiltonian is time-independent, the time development operator is, explicitly, 


Tit, t') = exp 


--m-n 


(15.41) 


and the propagator (15.31) then depends on the times t and t' only through their 
difference t — t'. 


K(r, r'; t - t') = £ <r| 


exp 


h H{t ~ ° 


E n ){E n |r'> 


where 


H\E n ) = E n | E n ) 

Hence, in terms of the energy eigenfunctions, explicitly/ 


K(r, r f - #') = E ( r ')<A„ (r) exp 

n 




(15.42) 


(15.43) 


The sum must be extended over the complete set of stationary states, including the 
degenerate ones. 

If in the Hamiltonian (15.29) the vector potential A = 0 and V(r) is static (or 
an even function of t ), the equation of motion is invariant under time reversal, as 
discussed in Section 3.5. In this case K*(r, r t' — t) is also a solution of the time- 
dependent Schrodinger equation (15.33). Since the initial condition (15.32) selects 
a unique solution, we must have, owing to time reversal symmetry, 

K*(r, r';t' ~ t) = K(r, r'; t - t') (15.44) 

Comparison of this result with the general property (15.34) of the propagator shows 
that under these quite common circumstances the propagator is symmetric with re- 
spect to the space coordinates: 


K(r, r';t - t’) = Kir', r; t ~ t') 


(15.45) 
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The simplest and most important example of a propagator is the one for the free 
particle. In one spatial dimension, 

T(t, t') = exp r')j (15.46) 

and, applying (15.43), in its integral form suitable for the continuous momentum 
variable, 

/*+ oo 

K(x, x'; t - t') = e -(i/*-)Pp-t’)/2m e -(M)p x ( X - X ’) dpx (15.47) 

This Fourier integral can be performed explicitly: 

(15.48) 



Exercise 15.8. Verify that (15.48) solves the time-dependent Schrodinger 
equation and agrees with the initial condition (15.32). If the initial ( t ' = 0) state of 
a free particle is represented by ^(x, 0) = e lkoX , verify that (15.35) produces the usual 
plane wave. 


If the system is initially (at t’ — 0) represented by the minimum uncertainty 
wave packet [see (10.66)] 

ip(x, 0) = [2 tt(Ax)o] ~ 1/4 exp | - + *&o*J (15.49) 

we have, by substituting (15.48) and (15.49) into (15.35), 



Exercise 15.9. Calculate | ip(x, t) | 2 from (15.50) and show that the wave packet 
moves uniformly and at the same time spreads so that 

(A*)? = (A*)5[l + <15.51) 

All these results for free-particle dynamics are in agreement with the prelimi- 
nary conclusions reached in Section 2.4, in Problem 1 in Chapter 2 and Problem 2 
in Chapter 3, and in Eq. (14.98). The generalization of the propagator formalism to 
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the motion of a free particle in three dimensions is straightforward, as long as Car- 
tesian coordinates are used. 

The linear harmonic oscillator is a second example. Setting t' — 0 for conve- 
nience, the Green’s function solution of the equation 


dK(x, x'\t) _ h 2 d 2 K(x, x'\ t) 1 
dt 2m dx 2 2 


+ - moy 2 K{x, x'\ t) 


(15.52) 


which satisfies the initial condition (15.32), can be calculated by many different 
routes. 

For instance, we may go back to the method developed in Sections 2.5 and 3.1 
and write the propagator in terms of a function S(x, x'\ t ): 


K(x, x'\ t) — e iS(xy;m 


(15.53) 


such that S satisfies the quantum mechanical Hamilton- Jacobi equation (3.2), 


dS 1 fdS\ ih d 2 S 

_ + _ _ _ + V(x) = 0 

dt 2m \dxl 2m dx 2 


(15.54) 


where for the linear harmonic oscillator V —mofx 2 / 2. Since this potential is an even 
function of the coordinate and since initially, 


K(~x, -x'; 0) - K(x, x'; 0) 


it follows that for all t. 


S(x, x'\ t ) = S(—x,— x'\ t ) = S{x ' , x\ t) — —S*(x, x'; —t) 


(15.55) 


(15.56) 


where the symmetry relations (15.34) and (15.45) have been employed. Combining 
all this information and attempting to solve (15.54) by a series in powers of x and 
x' , we conjecture that S must have the form 


S(x, x'\ t ) = a(t)x 2 + b(t)xx' + a{t)x' 2 + c(t) 


(15.57) 


Substitution of this Ansatz into (15.54) yields the coupled ordinary differential equa- 
tions, 


a = a — 


1 b 2 , 2 ih 

— r = — — , b = ah, c — — a (15.58) 

m 2 mo 2m m m 


All of these requirements, including especially the initial condition (15.32), can 
be satisfied only if (15.58) is solved by 


, . 1 i \ m(0 

a(t) = — moo cot cut, b{t) — , 

2 sin (ot 

giving finally the result 


c(t) = — log 


ih . ( 2irih 


sin (Dt 


K(x, x'; t) = 


2 mh sin cot 


2 ih sin cot 


(x 2 COS (Dt — 2xx' + x' 2 COS (Dt 


(15.59) 


In the limit co — » 0, the propagator (15.59) for the oscillator reduces to the free- 
particle propagator (15.48). 
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Exercise 15.10. Applying Mehler’s formula (5.46) to the stationary state ex- 
pansion (15.43) of the propagator, verify the result (15.59) for the linear harmonic 
oscillator. Conversely, show that (15.59) has the form 

e~ ito,/2 f(e~ itot ) 

and deduce the energy eigenvalues of the oscillator by comparing with (15.43). 


Exercise 15.11. Show that if the initial state of a harmonic oscillator is rep- 
resented by the displaced ground state wave function 


if/(x, 0 ) = N expi 


mco 

2 h 


( x ~ x 0 f 


(15.60) 


the state at time t is 


if/(x, t) — N exp-j 


mco 


2 h 


(x — x G cos cot) 2 — i 


(O 


mco . mco 2 . 

-I — — XqX sm cot — — x 5 sm 2 cot 
ft 4 ft > 


(15.61) 


Show that 1 1 jj(x, t) \ oscillates without any change of shape. 


Although, in principle, the propagator (15.59) answers all questions about the 
dynamics of the harmonic oscillator, for many applications, especially in quantum 
optics, it is desirable to express the time development of the oscillator in terms of 
coherent states. This was done in Section 14.6 in the interaction picture for the forced 
or driven harmonic oscillator. Here we revert to the Schrodinger picture. 

We know from Eq. (10.122) that the displaced oscillator ground state wave 
functio n (15.6 0) is the coordinate representative of a coherent state |a), with 
x 0 = V2 hi mco a. An initial state 

|W)) = |a) X (15.62) 


develops in time under the action of the free, unforced oscillator Hamiltonian in the 
Schrodinger picture (see Exercise 14.13) as 

|¥(f)> - e~ ioiiaU+imt \a) = e - io>t,2 \ ae~ iu,t ) (15.63) 

In words: If we represent a coherent state | a) by its complex eigenvalue a as a 
vector in the complex a plane (Figure 10.1), the time development of the oscillator 
is represented by a uniform clockwise rotation of the vector with angular velocity 
co. Since 



(15.64) 


the complex a plane can be interpreted as a quantum mechanical analogue of clas- 
sical phase space. The expectation values (x) and (p) perform harmonic oscillations, 
as dictated by classical mechanics. 


Exercise 15.12. Using (10.122), show that except for a normalization factor 
the amplitude (xj^(O) calculated from (15.63) again yields the oscillating wave 
packet (15.61). Relate the normalization factors for the two expressions. 


Exercise 15.13. For the harmonic oscillator, derive cp(x, t) directly from 
i//(x, 0) by expanding the initial wave function, which represents a displaced ground 



354 


Chapter 15 The Quantum Dynamics of a Particle 


state as in (15.60), in terms of stationary states. Use the generating function (5.33) 
to obtain the expansion coefficients and again to sum the expansion. Rederive 
(15.61). 

Exercise 15.14. From (15.64) and its time development, derive the expectation 
values ( x) t and ( p), in terms of their initial values. 

The forced or driven harmonic oscillator represents the next stage in complexity 
:>f an important dynamical system. If the Hamiltonian has the form 

rf \ 

H — 1 — mw 2 x 2 — xFft ) (15.65) 

2m 2 

where F(t ) stands for an arbitrary driving force, the propagator may be evaluated 
from the time development operator derived, in the interaction picture, in Section 
14.6. Alternatively, it may be obtained by extending the solution of the quantum 
mechanical Hamilton- Jacobi equation to the forced oscillator, adding the time- 
lependent interaction term —xFft) to the potential energy V(x) in (15.54). It is then 
Dossible (but tedious) to show that when the propagator is expressed in terms of the 
function S(x, t) as in (15.53), the result (15.57) for the free oscillator can still be 
ased, but it must be augmented by an interaction-dependent correction that is linear 
m the coordinates and has the form: 

s int (x, x'\ t, t') = fit, t')x - fit', t)x' + git, t') (15.66) 

We only quote the results of the calculation: 2 

fit, t') = 7 f Fit") sin wit" - t') dt" (15.67) 

sin wft - t ) jf 

ind 

dt" r" 

git, t') = — — — — dt x Fif) dt 2 F(t 2 ) sin (bif - t ') sin <oit 2 - t') 

Jr sin wit - t ) Jr Jf 

(15.68) 

All the quantum mechanical functions S that we have calculated so far are 
essentially the same as Hamilton’s Principal Function in classical mechanics, except 
'or a purely time-dependent term eft) in (15.57) and (15.58). This latter term arises 
from the presence of the term proportional to h (occasionally called the quantum 
potential) in the quantum mechanical Hamilton- Jacobi equation. It is responsible 
? or the correct normalization of the propagator. This very close connection between 
fiassical and quantum mechanics is contingent on the simple form of the interaction 
potential as a polynomial of second degree in x, as in the generalized parametric 
md driven harmonic oscillator. The addition of anharmonic terms to the interaction 
complicates matters and makes solving the Hamilton-Jacobi equation more difficult, 
rhe resulting 5(x, t) will generally exhibit more distinctive quantum effects. 

The propagator formalism and its expression in terms of the action function 
? provides a natural entree to Feynman’s path integral formulation of quantum 
lynamics. 


2 See Feynman and Hibbs (1965), Chapter 3. 
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3. Feynman's Path Integral Formulation of Quantum Dynamics. We saw in 
the last section that if the potential energy depends on the coordinate x only through 
terms that are linear or at most quadratic in x, the x and x' dependence of the 
propagator for a transition from spacetime (x\ t') to (x, t ) is entirely contained in a 
real- valued phase function, which is Hamilton’s (classical) Principal Function S for 
the motion between these two spacetime points. Here we use S (in roman font) to 
denote the classical function to distinguish it from its quantum mechanical counter- 
part, S (in italics). In this section, we limit ourselves to the motion of a particle in 
one dimension. 

From classical mechanics 3 it is known that S(x, x'\ t, t') is the stationary value 
of the classical action function 

I(x, x'; t, t') = | L(x(t"), x(t"), t") dt" (15.69) 

I 

where 

L(x(t), x(t), t) = - mx 2 — L(x, t) (15.70) 

is the classical Lagrangian function for the simple one-dimensional one-particle sys- 
tem that we are considering. Hamilton’s Principle for the variation of the action, 

8I(x, x'; t, t') = 8 L(x(t"), x(t"), t') dt" = 0 (15.71) 

singles out the motion x(£) that takes the particle from the initial spacetime point 
(x\ t') to the final destination (x, t ). Thus, Hamilton’s Principal Function is 

S(x, x'; t, t') = j' L(x(t"), x(t"), t") dt" (15.72) 

where it is now understood that x(t) is the correct classical motion connecting the 
two endpoints. 

Exercise 15.15. For a particle moving from spacetime point (x', t') to (x, t) 
with the classical Lagrangian L = (1/2 )mx 2 , show that S, derived from Hamilton’s 
Principle, reproduces the exponent in the free particle propagator (15.48). 

Exercise 15.16. For a particle moving from spacetime point (x\ t ') to (x, t) 
with the classical Lagrangian L = (1/2 )mx 2 - (1/2 )mo) 2 x 2 , show that S, derived 
from Hamilton’s Principle, reproduces the exponent in the harmonic-oscillator prop- 
agator (15.59). 

To derive the Feynman path integral expression for the propagator in quantum 
mechanics, we first observe that owing to the fundamental group property (14.8) of 
the time development, the propagator satisfies the composition rule: 

K(x, x'; t, t') = f K(x, x"; t, t")K{x", x'\ t", t') dx" 

J , (15.73) 

= (x| T(t, t')|x') = (x| T(t, t")\x")(x"\T(t", t')\x') dx" 


3 Goldstein (1980), Section 10.1. 
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for any value of the time t". In order to utilize a simple approximation, we partition 
the time interval ( t t) into N infinitesimally short intervals of duration e. When 
this is done, the composition rule generalizes to 

K(x, x'; t, t') = ... dx N - x . . . dx l K(x, x N -p, t, t — e) . . . 

J J (15.74) 

K(x 2 , Xi, t' + 2s, t' + e)K(x u x'\ t' + s, t') 

The construction of this expression implies that the x integrations are to be per- 
formed as soon as any two successive propagators are multiplied. Equivalently, how- 
ever, we may first multiply the N propagators, leaving the integrations to the end. 
Since each of the coordinates x ; ranges from — °° to + °°, the composition of the N 
propagators may then be construed as a sum over infinitely many different paths 
from the initial spacetime point (x', t') to the final spacetime point (x, t), as indicated 
schematically in Figure 15.1. The propagator for each infinitesimal time interval is 
now approximated by assuming that the motion of the particle from ( x n - x , t n _i) to 
(. x n , t„ ) is governed by a potential that is at most a second-degree polynomial in x. 
From Section 15.2, we know that in this approximation the propagator that takes us 
from (x„_!, t„_i) to ( x n , t„) is in the form 

K(x n ,x n - 1 ; e) - C(t n . u tn)e (.ms( XnXMn _o (1 5.75) 

If we multiply the N elementary propagators for a particular “path” in spacetime 
together in the integrand of (15.74) and take the limit s — » 0 and IV — > °o, the 
additivity of the action function shows that each path contributes, in units of Planck’s 
constant h, a real phase 

S[x(t)] = £ L(x(t"), x(t"), t") df (15.76) 

Although all quantities in (15.76) are classical functions of coordinates and of time, 
the path x(t) that takes the particle from the initial spacetime point (x', t') to the 
final destination (x, t) now is generally not the actual classical motion x(t) that 
Hamilton’s Principle selects. 



Figure 15.1 Paths linking the initial spacetime point (x',t') to the final spacetime point 
(x,t). The smooth curve represents the classical path x(r) for the particle motion in the 
potential V(x). The curve composed of broken straight segments is a typical path that 

makes a contribution exp|— S[x(t)]| to the Feynman path integral. 
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Substituting the results from (15.75) and (15.76) into the composition rule 
(15.74), we finally arrive at Feynman’s path integral formula for the propagator, 


( x , t \x' , t') — K{ x, x'; t, t') = 


c E « 

all paths 


(l/ft)SWf)] 


= c 

J 


(15.77) 


where the factor ‘C, which is independent of the coordinates, arises from the product 
of the time-dependent factors C(t n - 1 , t n ) in the propagators (15.75) and is attrib- 
utable to the term proportional to ih in the quantum mechanical Hamilton-Jacobi 
equation. In the last expression on the right-hand side of (15.77), the differential 
Z)[x(t)] is intended to remind us that the propagator is a functional integral , in which 
the variable of integration is the function x(f). To evaluate such an integral, which 
is the limit of the Sum over all paths sketched in Figure 15.1, it is obviously nec- 
essary to extend the concepts of mathematical analysis beyond the standard reper- 
toire and define an appropriate measure and a suitable parametrization in the space 
of possible paths. 4 

The derivation of (15.77) given here is a bit cavalier, but it captures the essence 
of the argument and produces correct results. To prove this, one can show that those 
contorted paths that are not accurately represented by the approximation (15.75) for 
the individual path segments contribute negligibly to the sum over all paths in 
(15.77), due to destructive interference caused by extremely rapid phase variations 
between neighboring paths. 

Although it is in general a difficult mathematical problem, the integration over 
paths reduces in many applications effectively to the sum of contributions from only 
a few isolated paths. The stationary phase method, which in effect was already used 
in Chapter 2 for obtaining approximate wave functions, is a useful tool for evaluating 
the propagator by the Feynman path integral method. The actual classical spacetime 
path x(t) that connects the initial and final spacetime points, (*', t') and (x, t), 
corresponds, according to Hamilton’s Principle, to the stationary phase in the path 
integral (15.77). The neighboring paths add constructively, and a first (semiclassical) 
approximation for the propagator is therefore 


K(x, x'\ t, t') « C{t, t') e (ms[xm 


(15.78) 


We saw in Section 15.2 that this formula is not just an approximation, but is exact 
for a large class of problems, including the free particle and the harmonic oscillator, 
even with an arbitrary linear driving term. 

In this chapter we have confined ourselves to describing the path integral for- 
mulation of quantum dynamics for the simple case of a nonrelativistic particle in 
one dimension, but the Feynman method is quite general. 5 For all systems that can 
be quantized by either method, it is equivalent to the canonical form of quantum 
mechanics, developed in Chapter 14, but the path integral approach offers a road to 
quantum mechanics for systems that are not readily accessible via Hamiltonian 
mechanics. 


4 Schulman (1981). 

5 For an excellent discussion of interference and diffraction of particle states in relation to path 
integrals, see Townsend (1992), Chapter 8. 
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4. Quantum Dynamics in Direct Product Spaces and Multiparticle Systems. Of- 
ten the state vector space of a system can be regarded as the direct, outer, or tensor 
product of vector spaces for simpler subsystems. The direct product space is formed 
from two independent unrelated vector spaces that are respectively spanned by the 
basis vectors ]AJ) and \B’f) by constructing the basis vectors 

I A[B' 2 ) = \A[) ® | B' 2 ) = \A[)\B' 2 ) (15.79) 

Although the symbol ® is the accepted mathematical notation for the direct product 
of state vectors, it is usually dispensed with in the physics literature, and we adopt 
this practice when it is unlikely to lead to misunderstandings. If n t and n 2 are the 
dimensions of the two factor spaces, the product space has dimension n x X n 2 . This 
idea is easily extended to the construction of direct product spaces from three or 
more simple spaces. 

The most immediate example of a direct product space is the state vector space 
for a particle that is characterized by its position r(x, y, z). The basis vector 
|r) = |.x, y, z ) may be expressed as the direct product |x) ® \y) ® |z) = |x)|y)|z), 
since the three Cartesian coordinates can be chosen independently to specify the 
location of the particle. (On the other hand, the Euclidean three-space with basis 
vectors x, y, z, is the sum and not the product of the three one-dimensional spaces 
supported by x and y and z.) 

Any operator that pertains to only one of the factor spaces is regarded as acting 
as an identity operator with respect to the other factor spaces. More generally, if M l 
and N 2 are two linear operators belonging to the vector spaces 1 and 2 such that 

MM'i) = 2 lA'lXA'llMM'i) 

A'j 

and 

N 2 \B' 2 ) = 2 \Bl){B'i\N 2 \B’ 2 ) 

B 2 

we define the direct or tensor product operator Mi ® N 2 by the equation 

M l ® N 2 \A[B’ 2 ) = £ \A'[B'f){A"\Mi \A'i)(B 2 \N 2 \B 2 ) (15.80) 

a 1< b 2 

Hence, A/j ® N 2 is represented by a matrix that is said to be the direct product of 
the two matrices representing M x and N 2 separately and that is defined by 


<A”B” 2 \M x <g) N 2 \A\B' 2 ) = (A'[\Mi\A[}(B'i\N 2 \B^) 


(15.81) 


Exercise 15.17. If Mj and P x are operators in space 1 and N 2 and Q 2 are 
operators in space 2, prove the identity 

Mi P x ® N 2 Q 2 = {Mi ® N 2 ){P x ® Q 2 ) (15.82) 

Check this identity for the corresponding matrices. 

We are now prepared to generalize the formalism of one-particle quantum me- 
chanics unambiguously to systems composed of several particles. If the particles are 
identical, very important peculiarities require consideration. Since Chapter 21 deals 
with these exclusively, we confine ourselves in this section to the quantum mechan- 
ics of systems with distinguishable particles. Furthermore, to make things clear, it 
is sufficient to restrict the discussion to systems containing just two particles. Ex- 
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amples are the ordinary hydrogenic atom or the muonium atom, including the dy- 
namics of the nucleus, the deuteron composed of a proton and a neutron, and the 
positronium (electron and positron) atom. 

We denote the two particles by the subscripts 1 and 2. As long as the spin can 
be ignored, six spatial coordinates are used to define the basis Ir^) = | Ti) | r 2 ) for 
the two-particle system. In analogy to (15.5), we introduce the two-particle wave 
function 

a 

*A( r i, r 2 ) — (r 1 r 2 |^ r ) (15.83) 

The interpretation of this probability amplitude is the usual one: | fi(r x , r 2 )| 2 d 3 r± d 3 r 2 
is proportional to the probability that particle 1 is found in volume element d 3 r[ 
centered at and simultaneously particle 2 in volume element d 3 r 2 centered at r 2 . 
If r 2 ) is quadratically integrable, we usually assume the normalization 

I 

J j \ip(r u r 2 )| 2 d 3 r x d 3 r 2 = 1 (15.84) 


Since ip is now a function of two different points in space, it can no longer be 
pictured as a wave in the naive sense that we found so fruitful in the early chapters. 
Instead, ip for two particles may be regarded as a wave in a six-dimensional config- 
uration space of the coordinates and r 2 . 

The Hamiltonian of the two-particle system (without spin and without external 
forces) is taken over from classical mechanics and has the general form 

" A + S + V(kl “ r2,) (15 - 85) 


In the coordinate representation, this leads to the Schrodinger equation 

< K**i, r 2 ) = Etp(r u r 2 ) (15.86) 


h 2 h 2 

V?-^-V 2 + .Vfl ri -r 2 |) 


j 2m i 2 m 2 

in configuration space. It is easily verified that the substitutions 


r = ij — r 2 


R _ m l r l + m 2 r 2 
m x + m 2 


(15.87) 


transform the Schrodinger equation to 


&_(#_ A A , _ A 

2 M \dX 2 + dY 2 + dZ 2 y 2 m r 



ip + V(r)ip = Efi 


(15.88) 


where now — if/( r, R) is a function of the relative coordinate r(x, y, z) and the 
coordinate of the center of mass R(X, Y, Z). In this equation, M = m x + m 2 is the 
total mass, and m r — m 1 m 2 /(m x + m 2 ) is the reduced mass of the system. 

The new Hamiltonian is a sum 


H = H R + H r (15.89) 

and each of the two sub-Hamiltonians possesses a complete set of eigenfunctions. 
Hence, all the eigenfunctions of (15.88) can be obtained by assuming that ip is a 
product 


«A(r, R) = <Kr)<p(R) 


(15.90) 
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and the energy a sum, E — E R + E r , such that 


h ?_ 
2 M 


Vlcp(R) - E R <p{ R) 


(15.91) 


and 


ti 2 

2 m r 


V^(r) + V(r)iff(r) = Epp(r) 


(15.92) 


As anticipated, the relative motion of a system of two particles subject to central 
forces can be treated like a one-particle problem if the reduced mass is used. This 
justifies the simple reduced one-particle treatment of the diatomic molecule (Section 
8.6) and the hydrogen atom (Chapter 12). As pointed out earlier, the most conspic- 
uous manifestation of the reduced mass is the shift that is observed in a comparison 
of the spectral lines of hydrogen, deuterium, positronium, muonium, and so on. 
Equation (15.91), whose solutions are plane waves, represents the quantum form of 
Newton’s first law: the total momentum of an isolated system is constant. 

The canonical transformation (15.87) could equally well have been made before 
the quantization. We note that the linear momenta are transformed according to 


m 2 m, 

p = m r (Vi — v?) = — pi p? 

F M FI M F2 

P = Pi + p 2 

(15.93) 

The kinetic energy takes the form 



II 

3 

SI* 


(15.94) 

and the orbital angular momentum of the system becomes 


L = r, X p : + r 2 X p 2 = r X p 

+ R X P 

(15.95) 

If the Hamiltonian is expressed as 



P 2 p 2 

H = — + -E— + V(r) 
2 M 2m,. V ' 


(15.96) 


subsequent quantization and use of the coordinate representation lead again to 
(15.88). 


Exercise 15.18. Prove that r, p and R, P defined in (15.87) and (15.93) satisfy 
the commutation relations for conjugate canonical variables. Also show that the 
Jacobian of the transformation from coordinates r l5 r 2 and r, R is unity. 

It is interesting to ask whether the wave function for a two-particle system is 
factorable, or separable, and can be written as the product of a function that depends 
only on the coordinates of particle 1 and a function that depends only on particle 2, 
such that 

<K r i, r 2 ) = (15.97) 

Obviously, such states are particularly simple to interpret, since we can say that in 
these cases the two particles are described by their own independent probability 
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amplitudes. This is sometimes expressed by saying that states like (15.97) do not 
exhibit correlations between the two particles. The fundamental coordinate basis 
states |r!r 2 ) = |r 2 )|r 2 ) have this special character. 

Exercise 15.19. Show that the state of two particles with sharp momenta pj 
and p 2 , corresponding to the plane wave function 

eX pfj£ P2 ‘ r2 ) (15.98) 

is also separable when it is transformed by use of (15.87) and (15.93) into i//(r, R). 

Most two-particle states are not factorable like (15.97). Except for the special 
case (15.98), wave functions of the type (15.90), which are factorable in relative 
coordinates, are generally not separable with regard to the two particles and are said 
to be correlated. An extreme example of a correlated wave function is afforded by 
the simple model of two particles confined to staying on the x axis and represented 
by an idealized amplitude: 

4*{x i, x 2 ) = «Kx, X) = if/(x)(p(X) = (2t^)~ 1/2 8(x x - x 2 — a) — (27 rh)~ y2 8{x — a) 

(15.99) 

Here a is a positive constant, which may be chosen as large as we please. The state 
represented by (15.99) corresponds to the two particles being separated precisely 
and invariably by the distance a, but the probability of finding one of the particles, 
say particle 1, anywhere regardless of the position of particle 2 is constant and 
independent of the position x x . Once a measurement shows particle 2 to be located 
at coordinate x 2 , then particle 1 is certain to be found at position x x — x 2 + a, and 
nowhere else. The wave function ip(x) = 8(x — a ) describes the relative motion of 
the two particles, and <p(X) = s/l/lirh is a momentum eigenstate of the center- 
of-mass motion, corresponding to zero total momentum. 

Since for any function i//(x x — x 2 ), 

h ( d d \ 

T ( — + — «K*i - x 2 ) = 0 
i ydxj dx 2 / 

we see that (15.99) is an eigenstate of the total momentum, corresponding to eigen- 
value zero. Hence, if the momentum of particle 2 is measured and found to have the 
value p 2> then particle 1 is certain to be found to have the sharp momentum value 
Pi = ~P2- 

Thus, depending on whether the coordinate or the momentum of particle 2 is 
measured, we are led to conclude that particle 1 after such a measurement is rep- 
resented by, a one-particle state of sharp position (delta function in coordinate space, 
centered at x x = x 2 + a) or sharp momentum (plane wave with momentum — p 2 ). 

In their famous 1935 paper, Einstein, Podolsky, and Rosen articulated the dis- 
tress that many physicists felt— and occasionally still feel — about these unequivocal 
conclusions of quantum mechanics. If we assume that the quantum mechanical am- 
plitude gives a complete (statistical) account of the behavior of a single system, it 
appears that, even when the two particles are arbitrarily far apart (large a), what can 
be known about the state of particle 1, after a measurement on particle 2 is under- 
taken, depends on the choice of measurement made on particle 2, such as a coor- 
dinate or a momentum measurement. These mysterious long-range correlations be- 
tween the two widely separated particles and the strange dependence of the expected 


00*1. r 2 ) = expl - pn • 1-j 
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behavior of particle 1 on the subjective fickleness of a distant human experimenter, 
who has no means of interacting with particle 1, seemed to Einstein to signal a 
violation of the innate sense that the world of physics is governed by local realism. 

Einstein tried to resolve this conflict by suggesting that quantum mechanical 
amplitudes pertain only to ensembles of systems, rather than single systems, and 
provide a correct but incomplete description of physical reality. In principle, a more 
complete theory, consistent with quantum mechanics, might thus be eventually dis- 
covered. But John S. Bell showed that such a program cannot be carried out, as long 
as the theory is required to be local, that is, not afflicted with unaccountable actions- 
at-a-distance between measuring devices. Any theory built on strict local realism 
fails to reproduce some predictions of multiparticle quantum mechanics — predic- 
tions that have been verified experimentally to a high degree of accuracy. The quest 
for a return to local realism in physics must thus remain unfulfilled, and we have to 
accept the existence of quantum correlations between widely separated subsystems. 
Furthermore, we persist in interpreting the formalism of quantum mechanics as pro- 
viding complete statistical predictions of the behavior of single systems. 

Exercise 15.20. An alternative representation of two-particle states is given 
in terms of the “mixed” basis states, | r, P), where r is the relative coordinate vector 
and P the total momentum. By using the intermediate coordinate basis j r, R), derive 
the transformation coefficients {r^lr, P). For the correlated state I'P) represented 
by (15.99), show that the wave function in the mixed relative representation is 

(x, P|^> = S(x ~ a)8(P ) (15.100) 

Correlated amplitudes like (15.99) or (15.100), which cannot be factored with 
regard to the two subsystems 1 and 2, are sometimes called entangled, a term coined 
by Schrodinger and illustrated dramatically in his famous cat allegory. Using the 
basis states (15.79), it is not difficult to construct examples of entangled states for 
a system composed of two independent subsystems. A general state may be expanded 
as 

\V) = 2 |AJ(*» ® (15.101) 

k,e 

where for typographic clarity the quantum number labels, k and €, for the basis 
states of the separate subsystems have been placed as arguments in parentheses 
instead of as the usual subscripts. The necessary and sufficient condition for the state 
of the composite system to be factorable with respect to particles 1 and 2 is that the 
n 1 X n 2 dimensional rectangular matrix of the amplitudes be ex- 

pressible in terms of n x + n 2 complex numbers as 

Ml)\ 

fl(2) ® W) K 2) • • b(n 2 )) (15.102) 

\a(rii )/ 

The state is entangled if and only if the amplitudes cannot be expressed in the form 
(15. 102). 6 

6 For a reprint compilation of the key historical papers on entangled states and the puzzling ques- 
tions they have raised, see Wheeler and Zurek (1983). 
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Exercise 15.21. Check that the amplitude (15.99) is entangled by making a 
(Fourier) expansion in terms of momentum eigenfunctions or any other complete set 
of one-particle basis functions. 

5 . The Density Operator, the Density Matrix, Measurement, and Information. 
The density operator for a state 1^(0) was defined in Eq. (14.19), and its time 
development was considered in Sections 14.1 and 14.2. As a projection operator for 
a state | 'F), the d’ensity operator 

p = |^><^| (15.103) 

contains all relevant information about the state. The density operator is idempotent, 
since owing to normalization, 

p 2 = I^X^I^X^I = I^X^I = P (15.104) 

Except for an irrelevant phase factor, the state | M*) can be recovered from the density 
operator as the eigenvector of p which corresponds to eigenvalue 1 . All expectation 
values can be expressed in terms of the density operator, as can be seen from 

<¥|A|¥> = 2 <^|^X^|A|^) = 2 = trac e(Ap) (15.105) 

i i 

For A — 1, this formula is the normalization condition 

0P|¥> = trace p = 1 (15.106) 

If (15.105) is applied to the projection operator A = | <£)(<!> | = P< t» an expression 
for a probability in terms of the density operator results: 

p* = |<<J>|¥>| 2 = <¥ | OX® I ¥> = = trace(|$)($|p) = trac e(P*p) 

(15.107) 

Since this can also be written as 

p<j> = |<0>|'F>| 2 = <<&|^X^|«>> = <9>|p|3>> (15.108) 

we infer that p is a positive Hermitian operator and, in particular, that the diagonal 
elements of any density matrix are nonnegative. 

The probabilities p* and 1 - p& are associated with the outcomes of measuring 
the positive operators P® and 1 — P <j>. Somewhat imprecisely we say that p® is the 
probability of finding the system in state | d>}, and 1 — p® is the probability of finding 
the system not to be in state |d>). 

Building on the foundations laid in chapters 4, 9, and 10 — especially Sections 
4.2 and 10.4 — we characterize a complete orthonormal quantum measurement by 
considering a set of n mutually orthogonal, and hence commuting, rank-one (or one- 
dimensional) projection operators P u P 2 , . . . P„ (where n is the dimensionality of 
the state vector, or Hilbert, space of the system). The completeness is expressed by 
the closure relation (10.44): 

n 

2 P t = 1 (15.109) 

e=i 

Each projection operator corresponds to a different outcome of the proposed mea- 
surement. In a specific application of the formalism, P e — \K ( )(K e \ may be the 
projection operator corresponding to the eigenvalue K ( of an observable (or, more 
generally, a complete commuting set of observables) symbolized by K. If this is the 
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case, we regard the values K e as the possible outcomes of the measurement. The 
probabilities of finding these outcomes are 

Pe = trace(P ( p) (15.110) 

Equation (15.109) guarantees that the probabilities add up to unity. If the state of 
the system happens to be an eigenstate the variance (or uncertainty) of K is 
zero, and K' t can be regarded as the sharp value of this observable, akin to a classical 
observable. Somewhat casually, we call this procedure a “measurement of the set 
of observable operators K.” 

Although the complete orthonormal measurements just described stand out 
prominently, it is possible to generalize the notion of a quantum measurement to 
include nonorthogonal operators. Thus, we assume the existence of a set of r positive 
Hermitian operators, 

Aj = WjPj (15.111) 

which are positive multiples of rank-one projection operators Pj. The projection 
operators in (15.111) are not necessarily mutually orthogonal, nor do they generally 
commute. With their weights, Wj (0 s Wj < 1), they are subject to the completeness, 
or overcompleteness, relation 

r r 

2 Aj = 2 w t p j = 1 (15.112) 

j= 1 7=1 

If the measurement is to be implemented on the system in any arbitrary state, com- 
pleteness requires that the number of terms r in this sum must be at least as large 
as n, the dimensionality of the system’s state vector space. The probability that the 
measurement described by the operators Aj yields the jth outcome is given by the 
formula 

Pj = <' v I r | A y - J '^ r > = trace(A 7 p) (15.113) 

The condition (15.112) ensures that these probabilities add up to unity. 

Exercise 15.22. Using the Schwarz inequality, prove that pj ^ Wj. 

In this brief discussion of the generalized quantum measurement defined by the 
set of r operators Aj, and technically referred to as a probability-operator-valued 
measure (or POM), we only emphasize the significance of nonorthogonal terms in 
(15.1 13). 7 Suppose that the state of the system is an eigenstate ( < I > *) of the element 
A k = w k P k of the POM, corresponding to eigenvalue w k . The probability of the jth 
outcome of the measurement defined by the POM is 

Pj = <$*|A;|$*> = w 7 <$*|/ > / |<3>*> = wj tra ce(PjP k ) (15.114) 

This shows that the probability of the fcth outcome ( j — k) may be less than unity, 
and that of the other outcomes (j + k) may not be zero (as would be expected if the 
states k and j were orthogonal). There is a quantum mechanical fuzziness inherent 
in nonorthogonal measurements. In these, unlike complete orthogonal measure- 


7 See Peres (1995) for more detail and many references to books and articles on quantum mea- 
surement and quantum information. For a clear discussion of entropy in the context of coding theory, 
see Schumacher (1995). 
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ments, p k can generally not be interpreted as the probability of finding the system 
to be in state |<b fc ), and this state cannot be associated unambiguously with just one 
of the possible outcomes of such a measurement. Nonorthogonal quantum measure- 
ments exhibit more peculiarly quantal features than orthogonal measurements 
of observables, whose eigenstates can be unambiguously associated with sharp val- 
ues of the corresponding physical quantities, in a manner reminiscent of classical 
physics. 

As an illustration of a POM for a one-dimensional system with an infinite- 
dimensional Hilbert space, we draw attention to the closure relation (10.111) for the 
coherent states of a harmonic oscillator: 



(15.115) 


which is precisely in the form (15.112), applied to a continuously variable outcome, 
identified by the complex number a. If the system is known to be in the Mi energy 
eigenstate of the harmonic oscillator, the probability density (per unit area in the 
complex a plane of Figure 10.1) for outcome a in a measurement of the nonorthog- 
onal POM defined by the coherent states, A a = (l/7r)|a)(a|, is 

p a {k) = ^-\a\ 2k e-\ a \ 2 (15.116) 


Except for the factor w a = 1/7T, this is the same as (10.110). 

So far, in this section, the density operator has merely served as an alternative 
to describing a quantum state by a vector in Hilbert space. It would be possible to 
formulate all of quantum mechanics exclusively in terms of density operators and 
their matrix elements, but the required mathematical techniques are generally less 
familiar than those pertaining to amplitudes and wave functions. (However, effective 
approximation schemes for complex many-particle quantum systems have been in- 
vented using density operator and density matrix methods. 8 ) 

In Chapter 16, we will illustrate the use of the density operator and its repre- 
sentative, the density matrix, for the simple case of a spin one-half system whose 
spatial coordinate degrees of freedom are irrelevant and can be suppressed. We will 
find that the full benefit of using the density matrix accrues only when it is applied 
to a statistical ensemble of imaginary replicas of the system in the tradition of Gibbs, 
thereby creating a mixture of different quantum states. A mixture can be visualized 
as the set of probabilities, or relative frequencies, p t , with which N different quantum 
states |*P t ) or the corresponding density operators p, = | [ occur in the ensem- 

ble denoted by %. We must require that 

N 

?,Pi= 1 (15.117) 

i=i 

but the states I'P,) generally need not be orthonormal. Equations (15.105), (15.107), 
and (15.108) show that probabilities and expectation values for quantum states de- 
pend on the density operator linearly, whereas they depend on the state | quad- 


8 See, for example, Parr and Yang (1989). 
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ratically. It follows that all statistical predictions for the ensemble can be expressed 
in terms of the generalized density operator of the system, 


N 

p = X Pipi 

1=1 


N 


2 p.l't'.X'p. 


(15.118) 


by the universal formula for the average value of an operator A 


N N 

(A) = 2 pWi\A\Vi) = 2 Pi trac e(Ap ; ) = trace(Ap) 

j— 1 i= 1 


(15.119) 


The density operator p, like its constituents, p u is a positive Hermitian operator. If 
all probabilities p, except one vanish, the density operator (15.118) reduces to the 
idempotent operator (15.103). It is then said to describe a pure state. Otherwise it 
represents a mixed state. Since in applications one usually employs a particular basis 
to represent states and operators, the same symbol p is often also used to denote the 
corresponding density matrix. 

All density operator relations that we derived for pure states at the beginning 
of this section carry over to the generalized density operator for a mixture, ex- 
cept for (15.104), which is quadratic in p and characterizes a pure state or one- 
dimensional (rank one) projection operator. Instead, owing to the positive definite- 
ness of the density operator, we have in general, 


0 < trace p 2 < (trace p) 2 = 1 and p n pjj 



(15.120) 


We have constructed the density operator p for a mixture from the assumed 
a priori knowledge of the N pure states p, representing the ensemble % and the 
corresponding probabilities p ; . It is not possible to reverse this procedure and to 
infer the composition of a mixture uniquely. A given density operator p is compatible 
with many (generally, infinitely many) different ways of mixing pure states. We will 
presently quantify the information loss that is incurred in the mixing process. How- 
ever, an exceptionally useful decomposition is always provided by a complete set 
of orthonormal eigenstates \pj) of the Hermitian density operator p and its eigen- 
values pj 


P = 2 Pj\Pj)(Pj\ (15.121) 

j= i 

where n is the dimensionality of the Hilbert space, and 

n 

2 \PM\ = 1 (15-122) 

j= i 

Some of the eigenvalues pj may be zero, and there is a certain amount of arbitrariness 
in the choice of eigenvectors, if eigenvalues are repeated. In particular, if all eigen- 
values of p are equal to 1 In, the density operator is proportional to the identity, 
p = (1 ln)l, and the mixture is as random as possible. Borrowing a term from the 
physics of spatially orientable systems, a completely mixed state for which 
p = (1 !n)l is said to be unpolarized. Any POM composed of positive operators 






5 The Density Operator, the Density Matrix, Measurement, and Information 


367 


A k — w k P k , which resolves the identity according to (15.112), can be employed to 
represent an unpolarized ensemble by writing 


P = 


1 V n 

- 2 , w jPj 


n ,= i 


(15.123) 


Exercise 15.23. Prove the inequalities (15.120). Hint: Trace inequalities are 
most easily proved by using the eigenstates of the density operator as a basis. For 
the second inequality, maximize the probability of finding the system in a super- 
position state 

hp> = c,.j^> + Cj \%) 


Exercise 15.24. If the state of a quantum system is given by a density operator 

p = /> 1 |'P 1 X'P 1 | + p 2 1 ¥ 2 ><^2 1 (15.124) 

where |'Pi, 2 ) are two nonorthogonal normalized state vectors, show that the eigen- 
values of the density operator are 

1 ± Vi - 4pip 2 (i ~ K^il^}! 2 ) 


P 1,2 = 


(15.125) 


If a mixed state with density operator p is defined by a given probability dis- 
tribution of N known pure states p u p 2 , ■ ■ . p N with probabilities p u p 2 , . . . p N , our 
incomplete knowledge of the state can be quantified in terms of the information and 
entropy concepts that are introduced in Section 2 of the Appendix. The Shannon 
mixing entropy (A. 43) for this ensemble %, denoted by H(%), is 

N 

H{%) = -E Pi In Pi (15.126) 

1=1 

We have chosen to express the entropy in terms of the natural logarithm, so that 
the nat is the unit of H{%). 

The quantity H{%) is a measure of our ignorance of the state. A large mixing 
entropy H(%) implies a highly randomized ensemble. If the state of the system is 
pure ( p 2 = p), the information is maximal and the mixing entropy is H(%) — 0. The 
information about the state is complete. If, on the other hand, all p, are equally 
probable. 

Pi = p 2 = ...= p N = ~ (15.127) 

and the mixing entropy is H(%) = In N nats. 

In quantum information theory one investigates how, given a set of a priori 
probabilities about a quantum state, our ignorance and the entropy can be reduced, 
or information gained, by performing measurements on an ensemble %. The decom- 
position (15.121) of a given density operator in terms of its complete set of ortho- 
normal pure eigenstates occupies a special place among the probability distributions 
compatible with p. Its mixing entropy is denoted by S(p) and defined as 


S(p) = -2 Pi ln Pt = (Pi\p ln p\pi) = ~trace(p In p) 

i= 1 i— 1 


(15.128) 
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Here, the function In p of the density operator is understood to be defined as in Eq. 
(10.30). Among all the different entropies that can be usefully defined, S(p ) is sin- 
gled out and referred to as the von Neumann entropy. It can be shown to be the 
smallest of all mixing entropies (15.126) for a given density operator: 

N n 

H{%) = -2 Pi In p ( > - 2 Pj In Pj = S(p) (15.129) 

i=l 7=1 

Thus, the ensemble composed of the orthonormal eigenstates of the density operator, 
the eigen-ensemble, is the least random of all the possible decompositions of p. In 
Chapter 16, this extremal property of the von Neumann entropy will be further 
demonstrated by several examples. 

Exercise 15.25. If an ensemble % consists of an equal-probability mixture 
of two nonorthogonal (but normalized) states I'TS) and | , 'I r 2 ) with overlap C = 
(^j |^ 2 ), evaluate the Shannon mixing entropy H(%) and the von Neumann entropy, 
S{p). Compare the latter with the former as | C| varies between 0 and 1 . What happens 
as C — » 0? 

Exercise 15.26. A given ensemble % consists of a mixture of two equiprobable 
orthonormal states |M f 1 ) and ( ^ 2 ) and a third normalized state l^), which is itself 
a superposition (not a mixture!) |M/ 3 ) = cj'p!) + c 2 |\I'' 2 ), so that the density operator 
is 

p = pj^X^il + P I ^ 2 X^ 2 1 + (1 - 2p) I ^ 3 X ^3 1 (0 ^ P S 1/2) (15.130) 

Work out the eigenprobabilities of p and the Shannon and von Neumann entropies. 
Discuss their dependence on the mixing probability p and on the amplitudes c 1>2 . 

Entropy can be defined for any probability distribution. To gauge the predict- 
ability of the outcome of a measurement of an observable AT on a system with density 
operator p, we define the outcome entropy : 

n 

H(K ) = -2 p(Kj) In p(Kj) (15.131) 

7=1 

Since the probabilities 

p(Kj) = trace( \Kj){Kj \ p) (15.132) 

can be calculated directly from the density operator, the value of the outcome en- 
tropy, H(K ), is independent of the particular ensemble % which represents p. Again, 
the von Neumann entropy stands out, because one can prove that 

n n 

H(K ) = -2 p(Kj) In p(Kj) > - 2 Pi In P,- = S(p) (15.133) 

7=1 i = 1 

For the special case of a pure quantum state | ^), or p = | '^)('^‘ | and S(p) = 0, the 
relation (15.133) makes the trite but true statement that there is generally an inevi- 
table loss of information, if we know only the probabilities | (Kj |''P‘)| 2 for measuring 
the observable K. We are missing the valuable information stored in the relative 
phases of the amplitudes. 

The fundamental significance of the von Neumann entropy S(p) should now be 
apparent. Thermodynamic considerations show that, multiplied by the Boltzmann 
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constant, k, the von Neumann entropy is also the form of the entropy whose maxi- 
mum, subject to certain constraints, yields, according to the second law of thermo- 
dynamics, the equilibrium distributions for quantum statistical mechanics. We will 
implement this principle in Section 22.5, after an introduction to the quantum phys- 
ics of identical particles. 

To complete the discussion of the density operator, we must give an account of 
its time evolution. If the density operator (15.118) for the system, with a Hermitian 
Hamiltonian H, is* given at some initial time t 0 , each constituent pure-state density 
operator p ; develops according to the equation of motion (14.20). Owing to the linear 
dependence of p on the components p,, and the linearity and homogeneity of (14.20), 
the density operator p develops in the Schrodinger picture according to the dynam- 
ical equation 



[H,p] 


(15.134) 


If we assume that 


ih 


dp n ~ l 

dt 


[ H , p n ~ l ] 


for any positive integer n, it follows by induction that 

ih ^ = ih ^ p"- 1 + ihp = [H, p]p n ~ l + p[H, p"- 1 ] = [H, p n ) 
dt dt dt 


Hence, (15.134) can be generalized for any analytic function /(p) of the density 
operator: 


ih 


df(p) 

dt 


= [ H , /(p)] 


(15.135) 


The equation of motion (14.18) for the expectation value of an operator A, which 
may be time-dependent, can be equally well applied to a mixed state: 


ih 


d{A) 

dt 


= {[A, H}) + ih 



= trace(p[A, H]) + ih trace 


P 



(15.136) 


By substituting A = f(p) in (15.136), and using (15.135), if follows that the expec- 
tation value of any function of the density operator is constant in time: 




(15.137) 


Exercise 15.27. Give a direct proof that for a general mixed state, 
{/(p)) = trace(p/) is constant in time, by noting that the density operator evolves 
in time by a unitary transformation, p(t) = Tp(t 0 )T f . 

Exercise 15.28. Prove that 


trace(p[/(p), H]) = trace(/(p)[/7, p]) = 0 


(15.138) 
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and that consequently, 



(15.139) 


As an important corollary, it follows that the von Neumann entropy, which is 
the mean of the density operator function In p, remains constant as the system 
evolves in time: 


dS(p) d 

— - — = — — trace(p In p) = 0 (15.140) 

dt dt 

This exact conclusion is not inconsistent with the familiar property of entropy in 
statistical thermodynamics as a quantity that increases in time during the irreversible 
approach to equilibrium, because it holds only under the precise conditions that we 
have specified, including the idealization that the probability distribution of the sta- 
tistical mixture representing the ensemble is fixed in time. 

The sketchy introduction to the concepts of the quantum theory of measurement 
and information presented in this section will be supplemented by concrete examples 
in the next chapter in the context of quantum mechanics in a vector space of only 
two dimensions. As we apply the results obtained in this section, we should remem- 
ber that common terms like “measurement” and “information” are being used here 
with a specific technical meaning. In particular, this is not the place for a detailed 
analysis of real experimental measurements and their relation to the theoretical 
framework. We merely note that, in the information theoretic view of quantum me- 
chanics, the probabilities and the related density operators and entropies, which are 
employed to assess the properties of quantum states and the outcomes of measure- 
ment, provide a coherent and consistent basis for understanding and interpreting the 
theory. 


Problems 

1. For a system that is characterized by the coordinate r and the conjugate momentum 
p, show that the expectation value of an operator F can be expressed in terms of the 
Wigner distribution IT(r', p') as 

(F) = <¥|F|¥> = 1 1 Fw(r\ p')W(r\ p') dV d 3 p' 


where 


F w ( r\ p') = j e m - r "(r’ - y|F|r' + y> dV 

and where the function JT(r', p ') is defined in Problem 5 in Chapter 3. Show 9 that 
for the special cases F = /( r) and F = g(p) these formulas reduce to those obtained 
in Problems 5 and 6 in Chapter 3, that is, F w (r') = /( r') and F w (p') = g(p'). 


9 Recall that in expressions involving the Wigner distribution r and p stand for operators, and the 
primed variables are real-number variables. 
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2. Show that the probability current density at r 0 is obtained with 


jw(r 0 ; r\ p') 


2m 


S( r' - r 0 ) 


so that the current density at r 0 is 


j(r 0 ) = J W(r 0 , p') ^ d 3 p' 

3. Derive the Wigner distribution function for an isotropic harmonic oscillator in the 
ground state. 

4. Prove that for a pure state the density operator 1 | is represented in the Wigner 
distribution formalism by 

PwC r\ P') = (27rft) 3 W(r\ p') 

Check that this simple result is in accord with the normalization condition < p ) = 1 
for the density operator. 

5. For a free particle, derive the equation of motion for the Wigner distribution 


6W(r f , p', t) + p; 
dt m 


V T ’W(x', p', t) 


0 


from the time-dependent Schrodinger equation. What does the equation of motion for 
W for a particle in a potential V(r) look like? 

6. Two particles of equal mass are constrained to move on a straight line in a common 
harmonic oscillator potential and are coupled by a force that depends only on the 
distance between the particles. Construct the Schrodinger equation for the system and 
transform it into a separable equation by using relative coordinates and the coor- 
dinates of the center of mass. Show that the same equation is obtained by first 
constructing a separable classical Hamiltonian and subjecting it to canonical quan- 
tization. 

7. Assuming that the two particles of the preceding problem are coupled by an elastic 
force (proportional to the displacement), obtain the eigenvalues and eigenfunctions 
of the Schrodinger equation and show that the eigenfunctions are either symmetric 
or antisymmetric with respect to an interchange of the two particles. 
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The spin (one-half) of a particle or atom or nucleus provides an 
opportunity to study quantum dynamics in a state vector space with only 
two dimensions. All laws and equations can be expressed in terms of two 
components and 2X2 matrices. Moreover, we gain insight into the effect 
of rotations on quantum states. The lessons learned here are transferable 
to the general theory of rotations in Chapter 17. Polarization and 
resonance in static and time-varying fields are characteristic spin features 
described by the theory and observed in experiments. The spin also lends 
itself to an explicit and relatively transparent discussion of the 
interpretation of quantum mechanics and its amplitudes, density matrices, 
and probabilities. In the quantum mechanics of two-dimensional complex 
vector spaces, it is possible to concentrate on the intriguing features of 
the theory, untroubled by mathematical complexities. 

1. Intrinsic Angular Momentum and the Polarization of ip Waves. In Chapter 
15, we were concerned with the quantum description of a particle as a mass point, 
and it was assumed that the state of the particle can be completely specified by 
giving the wave function ip as a function of the spatial coordinates x, y, z, with no 
other degrees, of freedom. The three dynamical variables were postulated to consti- 
tute a complete set. Alternatively and equivalently, the linear momentum compo- 
nents p x , p y , p z also form a complete set of dynamical variables, since <£(p) contains 
just as much information about the state as ip(r). The Fourier integral links the two 
equivalent descriptions and allows us to calculate (p from ip, and vice versa. 

It is important to stress here that completeness of a set of dynamical variables 
is to be understood with reference to a model of the physical situation, but it would 
be presumptuous and quite unsafe to attribute completeness in any other sense to 
the mathematical description of a physical system. For no matter how complete the 
description of a state may seem today, the history of physics teaches us that sooner 
of later new experimental facts will come to light which will require us to improve 
and extend the model to give a more detailed and usually more complete description. 

Thus, the wave mechanical description of the preceding chapters is complete 
with reference to the simple model of a point particle in a given external field, and 
it is remarkable how many fundamental problems of atomic, molecular, and nuclear 
physics can be solved with such a gross picture. Yet this achievement must not blind 
us to the fact that this simple model is incapable of accounting for many of the finer 
details. In particle physics and in many problems in condensed-matter physics, it is 
inadequate even for a first orientation. 

A whole host of quantum properties of matter can be understood on the basis 
of the discovery that many particles, including electrons, protons, neutrons, quarks, 
and neutrinos, are not sufficiently described by the model of a point particle whose 
wave function as a function of position or momentum exhausts its dynamical prop- 
erties. Rather, all the empirical evidence points to the need for attributing an angular 
momentum or spin to these particles in addition to their orbital angular momentum, 
and, associated with this, a magnetic moment. For composite particles like protons 
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and neutrons, these properties can be understood in terms of their internal quark 
structure, but leptons like electrons and muons appear to be elementary point-like 
constituents of matter, yet nevertheless possess intrinsic angular momentum. 

What is the most direct evidence for the spin and the intrinsic magnetic mo- 
ment? Although it was not realized at the time. Stern and Gerlach first measured the 
intrinsic magnetic moment in experiments 1 whose basic features are interesting here 
because they illustrate a number of concepts important in interpreting quantum me- 
chanics. The particles, which may be entire atoms or molecules whose magnetic 
moment pi is to be measured, are sent through a nonuniform magnetic field B. They 
are deflected by a force which according to classical physics is given by 

F = V(pi ■ B) (16.1) ' 

and they precess around the field under the influence of the torque t = pi X B. The 
arrangement is such that in the region through which the beam passes the direction 
of B varies only slowly, but its magnitude B is strongly dependent on position. 
Hence, the projection p B of pi in the direction B remains sensibly unchanged, and 
we have approximately 

F « /x s VB (16.2) 

By measuring the deflection, through inspection of the trace that the beam deposits 
on the screen, we can determine this force, hence the component of the magnetic 
moment in the direction of B. Figure 16.1 shows the outline of such an experiment. 

The results of these experiments were striking. Classically, we would have ex- 
pected a single continuous trace, corresponding to values of p B , ranging from —/jl 
to + / x . Instead, observations showed a number of distinct traces, giving clear proof 
of the discrete quantum nature of the magnetic moment. Since the vector pi seemed 
to be capable of assuming only certain directions in space, it became customary to 
speak of space quantization. 

Stern and Gerlach also obtained quantitative results. They found that the values 
of p, B appeared to range in equal steps from a minimum, —/a, to a maximum, /a. The 
value /a of the maximum projection of pi is conventionally regarded as the magnetic 
moment of a particle. 

In order to interpret these results, we recall Ampere’s hypothesis that the mag- 



Figure 16.1. Measurement of the vertical component of the magnetic moment of atoms in 
an inhomogeneous magnetic field (Stern-Gerlach experiment). Silver atoms incident from 
the left produce two distinct traces corresponding to “spin up” and “spin down.” 


^ee Cagnac and Pebay-Peyroula (1971), p. 239. 
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netic properties of matter are attributable to electric currents of one form or another. 
Thus, the circulating currents due to electrons (of charge —e and mass m e ) in atoms 
produce an orbital angular momentum L and a magnetic moment pt connected by 
the classical relation, 



(16.3) 


which, being a simple proportionality of two vectors, is expected to survive in quan- 
tum mechanics also. Since any component of L has 2€ + 1 eigenvalues, we expect 
the projection of |x in a fixed direction, such as B, also to possess 2€ + 1 distinct 
eigenvalues and to be expressible as 


Mb = 


eft 
2 m e c 


m = —f3 0 m 


(16.4) 


where the magnetic quantum number m can assume the values — €, — € + 1, € — 1, 
€. The Bohr magneton j8 0 is defined as 


j6 0 = 


eh 

2 m e c 


(16.5) 


and has 2 * the value 9.27401 X 10 24 J/T = 9.27401 X 10 21 erg/gauss = 5.78838 X 
10" 5 eV/T. 

Since € is an integer, we expect an odd number (2€ + 1) of traces in the Stern- 
Gerlach experiment. It is well known that the classical experiment with a beam of 
silver atoms, passing through an inhomogeneous magnetic field, yielded instead two 
traces, i.e., an even number, corresponding to 


Mb = 


eh 

2 m e c 


— /3 0 


We may ask if the semiclassical arguments used above are valid when we con- 
tend with quantum phenomena. Equation (16.2) is purely classical, and we may 
wonder if its application to quantized magnetic moments has not led us astray. The 
answer to these questions is that like most experiments the Stern-Gerlach experiment 
has components that are properly and correctly described by the laws of classical 
physics. For these are the laws that govern the experiences of our senses by which 
we ultimately, if indirectly, make contact with what happens inside atoms and nuclei. 
If the particles that the inhomogeneous field in a Stern-Gerlach experiment deflects 
are sufficiently massive, their motion can be described by wave packets that spread 
very slowly; hence, this motion can be approximated by a classical description. 

The correct interpretation was given to the Stern-Gerlach observations only 
after Goudsmit and Uhlenbeck were led by a wealth of spectroscopic evidence to 
hypothesize the existence of an electron spin and intrinsic magnetic moment. If one 
assumes that the electron is in an 5 state in the Ag atom, there can be no contribution 
to the magnetic moment from the orbital motion, and /x = eh!2m e c measures the 
maximum value of a component of the intrinsic magnetic moment. Unlike a mag- 
netic moment arising from charged particles moving in spatial orbits, this magnetic 
moment may be assumed to have only two projections, p, B = ±/3 0 . According to the 


2 Cohen and Taylor (1996). This useful compilation of fundamental constants is updated and ap- 

pears annually in the August issue of Physics Today. 
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Goudsmit-Uhlenbeck hypothesis, we envisage the electron to be a point charge with 
a finite magnetic dipole moment, the projection of which can take on only two 
discrete values. It is now known that the electron magnetic moment differs very 
slightly from the Bohr magneton and has the value 1.001 159 652 193 j8 0 , owing to 
a small quantum electrodynamic correction. The muon magnetic moment similarly 
differs by a minute amount from its naively expected value m e /3 0 /m muon . 

Goudsmit and Uhlenbeck also postulated that the electron has an intrinsic an- 
gular momentum (spin), but this quantity is not nearly as easy to measure directly 
as the magnetic moment. Without appealing to the original justification for the elec- 
tron spin, which was based on experience with atomic spectra, we can marshal a 
fundamental argument for the assumption that an electron must have intrinsic an- 
gular momentum: From experiment we know that an electron, whether free or bound 
in an atom, does have a magnetic moment. Unless the atomic electron, moving in 
the electric field of the nucleus, possesses intrinsic angular momentum, conservation 
of angular momentum cannot be maintained for an isolated system such as an atom. 

To elaborate on this point, we note that, just as a moving charge is subject to 
a force in a magnetic field, so a moving magnetic moment, such as the intrinsic 
electron moment is envisaged to be, is also acted on by forces in an electric field. 
The potential energy associated with these forces is 

v „ 

|x • - X E 
c 

which, for a central field [E = /(r)r], is proportional to (jl • v X r, or to 

|x • r X v = (x • L (16.6) 

The factor of proportionality depends only on the radial coordinate r. If the Ham- 
iltonian operator contains, in addition to the central potential, an interaction term 
like (16.6) proportional to pi • L, the energy of the electron depends on the relative 
orientation of the magnetic moment and the orbital angular momentum. It is apparent 
that L, whose components do not commute, can then no longer be a constant of the 
motion. Conservation of angular momentum can be restored only if the electron can 
participate in the transfer of angular momentum by virtue of an intrinsic spin as- 
sociated with the intrinsic magnetic moment pi. 

We conclude that the magnetic moment of a system must always be associated 
with an angular momentum (see Section 16.4). For leptons with no internal structure, 
the relativistic Dirac theory of the electron in Chapter 24 will provide us with a 
deeper understanding of these properties. However, at a comparatively unsophisti- 
cated level in describing interactions that are too weak to disturb the internal struc- 
ture of the particles appreciably, we may treat mass, charge, intrinsic angular mo- 
mentum, and magnetic moment as given fixed properties. 

As the presence of h in the formula p — eh/2mc shows, the intrinsic spin and 
the corresponding magnetic moment are quantum effects signaling an orientation in 
space, and we must now find an appropriate way of including this in the theory. 
Wave mechanics was developed in Chapter 2 with relative ease on the basis of the 
correspondence between the momentum of a particle and its wavelength. This sug- 
gests that, in our effort to construct a theory that includes the spin, we should be 
aided by first determining what wave feature corresponds to this physical property. 

A scattering experiment can be designed to bring out the directional properties 
of waves. If a homogeneous beam of particles described by a scalar wave function 
i f/(x, y, z, t), such as alpha particles or pions, is incident on a scatterer, and if the 
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target is composed of spherically symmetric or randomly oriented scattering centers 
(atoms or nuclei), as discussed in detail in Chapter 13, we expect the scattered 
intensity to depend on the scattering angle 6 but not on the azimuthal angle <p that 
defines the orientation of the scattering plane with respect to some fixed reference 
plane. In actual fact, if the beam in such experiments with electrons, protons, neu- 
trons, or muons is suitably prepared, a marked azimuthal asymmetry is observed, 
including a right-left asymmetry between particles scattered at the same angle d but 
on opposite sides of the target. It is empirically found that the scattered intensity 
can be represented by the simple formula 


7 = a(d ) + b(6) cos <p 


(16.7) 


provided that a suitable direction is chosen as the origin of the angle measure <p. 
The simplest explanation of this observation is that i/r representing an electron is not 
a scalar field, and that if/ waves can be polarized. (Here, “electron” is used as a 
generic term. Polarization experiments are frequently conducted with protons, neu- 
trons, atoms, nuclei, and other particles.) 

Figure 16.2 shows the essential features of one particular polarization experi- 
ment. A beam 7 0 of unpolarized electrons is incident on an unpolarized scatterer A. 
The particles, scattered at an angle from the direction of incidence, are scattered 
again through the angle d 2 by a second unpolarized scatterer B, and the intensity of 
the so-called second scattered particles is measured as a function of the azimuthal 
angle <p, which is the angle between the first and second planes of scattering. Owing 
to the axial symmetry with respect to the z axis, the intensities f and I[ are equal, 
but I R # I L , and the azimuthal dependence of the second scattered particle beam can 
be fitted by an expression of the form (16.7). 

It is instructive to compare these conclusions with the results of the analogous 
double scattering experiment for initially unpolarized X rays. With the same basic 
arrangement as in Figure 16.2, no right-left asymmetry of X rays is observed, but 
the polarization manifests itself in a cos 2 (p dependence of the second scattered 
intensity. Since intensities are calculated as squares of amplitudes, such a behavior 
suggests that electromagnetic waves may be represented by a vector field that is 
transverse and whose projection on the scattering plane, when squared, determines 
the intensity. 

The presence of a cos <p, instead of a cos 2 <p, term precludes a similar conclusion 
for the electron waves and shows that, if their polarization can be represented by a 
vector, the intensity must depend on this vector linearly and not quadratically. 
Hence, the wave function, whose square is related to the intensity, is not itself a 
vectorial quantity, and the polarization vector (P) will have to be calculated from it 
indirectly. 

In summary, the polarization experiments suggest that the wave must be rep- 
resented by a wave function, which under spatial rotations transforms neither as a 
scalar nor as a vector, but in a more complicated way. On the other hand, the inter- 
pretation of the Stern-Gerlach experiment requires that, in addition to x, y, z, the 
wave function must depend on at least one other dynamical variable to permit the 
description of a magnetic moment and intrinsic angular momentum which the elec- 
tron possesses. Since both the polarization of the waves and the lining up of the 
particle spins are aspects of a spatial orientation of the electron, whether it be wave 
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Figure 16.2. Geometry of a double scattering experiment. The first plane of scattering at 
A is formed by I 0 , /, and I[, in the plane of the figure. The first scattering polarizes the 
beam, and the second scattering at B and B' analyzes the degree of polarization. The 
second plane of scattering, formed by I u I R , and I L , need not coincide with the first plane 
of scattering. The angle between the two planes is <p, but is not shown. 


or particle, it is not far-fetched to suppose that the same extension of the formalism 
of wave mechanics may account for both observations. 

Similarly, we will see in Chapter 23 that the vector properties of electromag- 
netic waves are closely related to the intrinsic angular momentum (spin 1) of 
photons. 

2. The Quantum Mechanical Description of the Spin. Although the formalism 
of quantum mechanics, which we developed in Chapters 9, 10, and 14, is of great 
generality, we have so far implemented it only for the nonrelativistic theory of single 
particles that have zero spin or whose spin is irrelevant under the given physical 
circumstances. To complement the set of continuously variable fundamental observ- 
ables x, y, z for an electron, we now add a fourth discrete observable that is assumed 
to be independent of all the coordinate (and momentum) operators and commutes 
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with them. We denote its (eigen-)values by cr. This spin variable, which is capable 
of taking on only two distinct values, is given a physical meaning by associating 
the two possible projections of the magnetic moment jjl, as measured in the Stern- 
Gerlach experiment, with two arbitrarily chosen distinct values of cr. 

cr = +1 with p B 
<j = — 1 with p B 

Often cr — +1 is referred to as “spin up” and cr = —1 as “spin down” (see Figure 
16.1). We assume that the basic rules of quantum mechanics apply to the new in- 
dependent variable in the same way as to the old ones. 

In the coordinate representation, the probability amplitude or wave function for 
an electron now depends on the discrete variable cr in addition to x, y, z, and may 
be written as ^(r, t) — (r, cr, t|T f ). This can be regarded as a two-component object 
composed of the two complex-valued amplitudes, ij/+( r, t ) = (r, +1, f | M r ) for “spin 
up” and i//_(r, t ) — (r, — 1, 1 1^) for “spin down.” Suppressing the time dependence 
of the wave function, | ift±(x, y, z)\ 2 dx dy dz is thus assumed to measure the prob- 
ability of finding the particle near x, y, z, and of revealing the value p B = +/3 0 , 
respectively, for the projection of the magnetic moment in the direction of the 
field B. 

There is no a priori reason to expect that such a modest generalization of the 
theory will be adequate, but the appearance of merely two traces in the Stern-Gerlach 
experiment, and, as we will see later, the splitting of the spectral lines of one-electron 
atoms into narrow doublets, make it reasonable to assume that a variable which can 
take on only two different values — sometimes called a dichotomic variable — may 
be a sufficiently inclusive addition to the theory. 

The mathematical apparatus of Chapters 9 and 10 can be applied to extend the 
formalism of wave mechanics without spin to wave mechanics with spin. Since space 
and spin coordinates are assumed to be independent of each other, it is natural to 
use a two-dimensional matrix representation for the specification of the state: 

(16.9) 

where the matrix i/s, with one column and two rows, now stands for a two-component 
spin wave function. Wherever we previously had an integration over the continu- 
ously infinitely many values of the position variables, we must now introduce an 
additional summation over the pairs of values which cr assumes, such as in the 
normalization integral: 

i p~(x, y, z) | 2 dx dy dz = 1 (16.10) 

It is instructive to study the behavior of the spin variable separately from the 
space coordinates and to consider a system whose state is described by ignoring the 
x, y, z coordinates and determined, at least to good approximation, entirely by two 
spin amplitudes. We designate such a general spin state as x and write it as 


c 

1 i (t+(x, y, z) | 2 dx dy dz + 

•r V 




X = 


Cl 

C 2 


(16.11) 
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The complex-valued matrix elements c x and c 2 are the amplitudes for “spin up” 
and “spin down,” respectively. The column matrix (16.11), often referred to as a 
spinor, represents a state vector in an abstract two-dimensional complex vector 
space. Such states are more than mathematical idealizations. In many physical sit- 
uations, the bodily motion of a particle can be ignored or treated classically, and 
only its spin degree of freedom need be considered quantum mechanically. The study 
of nuclear magnetism is an example, since we can discuss many experiments by 
assuming that the ‘nuclei are at fixed positions and only their spins are subject to 
change owing to the interaction with a magnetic field. 

Study of the spin formalism in isolation from all other degrees of freedom serves 
as a paradigm for the behavior of any quantum system whose states can be described 
as linear superpositions of only two independent states. There are innumerable prob- 
lems in quantum mechanics where such a two-state formalism is applicable to good 
approximation, but that have nothing to do with spin angular momentum. The anal- 
ysis of reflection and transmission from a one-dimensional potential in Chapters 6 
and 7 has already illustrated the convenience of the two-dimensional matrix for- 
malism. Other examples are the coupling of the 2 S and 2 P states of the hydrogen 
atom through the Stark effect (Chapter 18), the magnetic quenching of the triplet 
state of positronium (Problem 4 in Chapter 17), the isospin description of a nucleon, 
the transverse polarization states of a photon (Chapter 23), and the life and death of 
a neutral kaon (Problem 1 in Chapter 16). 

The basis states of the representation defined by the assignments (16.8) are 

« = (q) and /3 = ^ (16.12) 


Thus, a represents a state with spin “up,” and /3 represents a state with spin 
“down.” In the general state, 



Cia + c 2 j 3 


(16.13) 


| Ci | 2 is the probability of finding the particle with spin up, and | c 2 1 2 is the probability 
of finding it with spin down. Hence, we must require the normalization 


ic,p + ic 2 p = (cT cnr) = i 

This can be written as 

X f X = 1 

if we remember that 


(16.14) 


(16.15) 


X f = (cf ct) 


(16.16) 


Given two spinors, x and X ' » the (Hermitian) inner (or complex scalar) product is 
defined as 


*v = (4 = etc : + ctd (i6.i7) 

Two spinors are orthogonal if this product is zero. The two spinors a and /3 defined 
in (16.12) are orthogonal and normalized, as a + a = j 6 + /3 = 1. Such pairs of ortho- 
normal spinors span the basis of a representation. 
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All definitions and manipulations introduced in Chapters 4, 9, and 10 for com- 
plex linear vector (Hilbert) spaces of n dimensions can in this simple case, where 
n = 2, be written out explicitly in terms of two-dimensional matrices. If we commit 
ourselves to a specific fixed representation, all equations and theorems for state 
vectors and linear operators can be interpreted directly as matrix equations. As long 
as confusion is unlikely to occur, the same symbol can be used for a state and the 
spinor that represents it; similarly, the same letter may be used for a physical quan- 
tity and the matrix (operator) that represents it. In many ways, the spin formalism 
is much simpler than wave mechanics with its infinite-dimensional representations. 
Since the state vector space is two-dimensional, the mathematical complexity of the 
theory is significantly reduced. 

For example, if A is a linear operator (perhaps representing a physical quantity), 
it appears as 



and its action on the spinor x, which produces the new spinor £ = A\, is represented 
as 



(16.19) 


where the components of f are denoted by d\ and d 2 . The (Hermitian) adjoint A" of 
A is 



and the expectation value of A in the state x is 


(A) = (cf 



(16.20) 


(16.21) 


* 

Exercise 16.1. In the spin matrix formalism, show that if and only if the 
expectation value of a physical quantity A is real-valued, the matrix A is Hermitian. 
Prove, by direct calculation, that the eigenvalues of any Hermitian 2X2 matrix are 
real and its eigenspinors orthogonal if the two eigenvalues are different. What hap- 
pens if they are the same? 


An arbitrary state can be expanded in terms of the orthonormal eigenspinors, u 
and v, of any Hermitian matrix A: 

X = “(w + X) + v(v'x) (16.22) 

The expansion coefficients 

(tdx) = (u* wf)^ 1 ^ and (u 1 *) = (v* (16.23) 

are the probability amplitudes of finding the eigenvalues of A corresponding to the 
eigenspinors u and v , respectively. 

To endow this purely mathematical framework with physical content, we must 
identify the physical quantities associated with the spin of a particle and link them 
with the corresponding Hermitian matrices. A physical quantity of principal interest 
is the component of the electron’ s intrinsic magnetic moment in the direction of the 
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magnetic field, which motivated the extension of the theory to dichotomic spin vari- 
ables. Since B can be chosen to point in any direction whatever, we first select this 
to be the z axis of the spatial coordinate system. Then the z component of the intrinsic 
magnetic moment of an electron is evidently represented by the Hermitian matrix 


Hz = ~P 



(16.24) 


since the eigenvalues of are to be Tp 0 = +eh/2m e c, and the corresponding states 
may be represented by the basis spinors a and (3. 

How are the other components of |x represented? The magnetic moment (x has 
three spatial components, /jl x , p y , p z , and by choosing a different direction for B we 
can measure any projection, p B , of |x. If our two-dimensional formalism is adequate 
to describe the physical situation, any such projection p B must be represented by a 
Hermitian matrix with the eigenvalues — /3 0 and +/3 0 . In order to determine the 
matrices p x and p y , we stipulate that the three components of (|x) must under a 
rotation transform as the components of an ordinary three-vector. Since an expec- 
tation value, such as (p x ) = x 1 HxX-> is calculated from matrices and spinors, we 
cannot say how the components of (|x) transform unless we establish the transfor- 
mation properties of a spinor x under rotation. We will now turn to this task. 


3. Spin and Rotations. Rotations of systems described by wave functions 
y, z) were already considered in Chapter 1 1 ; here we extend the theory to spin 
states. We first consider a right-handed rotation of the physical system about the 
z axis, keeping the coordinate axes fixed. This is an active rotation, to be distin- 
guished from a passive rotation, which leaves the physical system fixed and rotates 
the coordinate system. As long as we deal solely with the mutual relation between 
the physical system under consideration and the coordinate system, the distinction 
between these two kinds of rotations is purely a matter of taste. However, if, as is 
more commonly the case, the physical system that we describe by the quantum 
mechanical state vector is not isolated but is embedded in an environment of external 
fields or interacts with other systems, which we choose to keep fixed as the rotation 
is performed, the active viewpoint is the appropriate one, and we generally prefer it. 

Figure 11.1 (ft) pictures an active rotation by an angle 4> about the z axis, which 
carries an arbitrary spin state x iuto a state x' . The relation between these two 
spinors may be assumed to be linear. (As will be shown in Section 17.1, this as- 
sumption involves no loss of generality, and in any case we will see that a valid 
linear transformation representing any rotation can be found.) Thus, we suppose that 
the two spinors are related by 


X' = UX (16-25) 

where U is a matrix whose elements depend on the three independent parameters of 
the rotation only, e.g., the axis of rotation n and the angle cf> . 

Since the physical content of the theory should be invariant under rotation, we 
expect that normalization of X implies the same normalization of x'- 

x'V = x t u f u x = 1 

Since x is arbitrary, it follows that 


UW = 1 


(16.26) 
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so U must be a unitary matrix. From this matrix equation we infer that 

det U f det U = | det U\ 2 = 1 (16.27) 

Hence, a unitary matrix has a unique inverse, U~ l = I/ t , and 

UU f = 1 (16.28) 

The unitary matrix U, which corresponds to the rotation that takes x into is 
said to represent this rotation. If U\ represents a rotation R x about an axis through 
the origin, and U 2 represents a second rotation R 2 also about an axis through the 
origin, then U 2 U l represents another such rotation R 3 , obtained by performing first 
and then R z . In this way x is first transformed to x' ~ U\X> which subsequently 
is transformed to x" = U 2 x' = U 2 U\X- Alternatively, we could, according to Euler’s 
famous theorem, have obtained the same physical state directly from x by perform- 
ing a single rotation R 3 , represented by U 3 . Hence, the unitary rotation matrices are 
required to have the property 

U 2 U i = e i(p(R ' R ^U 3 (16.29) 

The phase factor has been put in, because all spinors e ,(p x represent the same state. 

Our goal, the construction of U corresponding to a given rotation R, will be 
considerably facilitated if we consider infinitesimal rotations first. A small rotation 
must correspond to a matrix very near the identity matrix, and thus for a small 
rotation we write to a first approximation the first two terms in a Taylor series: 

U = 1 - t en • J (16.30) 

, n 

where n is the axis of rotation, e is the angle of rotation about this axis, and J 
represents three constant matrices J x , J y , J z . Their detailed structure is yet to be 
determined, and they are called the generators of infinitesimal rotations. The factor 
Uh has been introduced so that J will have certain desirable properties. In particular, 
the imaginary coefficient ensures that J must be Hermitian if U is to be unitary, i.e. 

J + = J (16.31) 

If the three matrices J x , J y , and J z were known, U for any finite rotation could be 
constructed from (16.30) by successive application of many infinitesimal rotations, 
i.e., by integration of (16.30). 

This integration is easily accomplished because any rotation can be regarded as 
the product of N = fils successive rotations by a small angle e about a fixed axis 
(constant n): 

u * ~ i 1 ~ ■ 3 ) (1632) 

or in the limit N — » °°, 

U * = „!“(/ “ M * • J ) (> 6 ' 33 > 

as in elementary calculus, even though U R and n • J are matrices. The exponential 
function with a matrix in the exponent is defined by (16.33) or by the usual power 
series expansion. The necessary groundwork for the matrix algebra was laid in Sec- 
tion 3.4, where we may read “matrix” for “operator.” 
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We still have to derive the conditions under which a matrix of the form (16.33) 
is actually the solution to our problem, i.e., represents the rotation R and satisfies 
the basic requirement (16.29). The application of the condition (16.29) will lead to 
severe restrictions on the possible form of the Hermitian matrices J x , J y , J z , which 
so far have not been specified at all. However, it is convenient not to attack this 
problem directly, but instead to discuss first the rotational transformation properties 
of a vector (A), where A x , A y , A z are three matrices (operators) such that the expec- 
tation values ( A x ), s ( A y ), (A z ) transform as the components of a vector. As stated at 
the end of the last section, the components of the magnetic moment (jx) are an 
example of matrices that must satisfy this condition. Generally, as in Section 11.1, 
a set of three matrices A x , A y , A z is called a vector operator A if the expectation 
values of A x , A y , A z transform under rotation like the components of a vector. 

It is of decisive importance to note that J itself is a vector operator. This follows 
from its definition as the generator of the infinitesimal rotation: 

X ' = (l- l -s n • j)* (16.34) 

Multiplying on the left by we obtain 

/A' = ^ en • <J> (16.35) 

where the expectation value (J) is taken with respect to the state x • The inner prod- 
ucts are invariant under a unitary transformation that represents an arbitrary finite 
rotation, applied simultaneously to both x and x' • Hence, the scalar product n • (J) 
is also a rotational invariant. Since n is a vector, (J) must also transform like a 
vector, and thus J is a vector operator. 

The transformation properties of a three- vector (A) — ^A^ under an active 
rotation are characterized by the equation 

(A)' = (A) + n X (n X (A))(l — cos <fi) + n X (A) sin (f> (16.36) 

where (A)' = ^A^' is the expectation value of A after rotation. In standard 3 X 
3 matrix notation, this equation appears as 



f(A x )\ 



= * (Ay) 

(16.37) 

<AzY I 

\(A Z )/ 



where R is the usual real orthogonal rotation matrix (with det R — 1) familiar from 
analytic geometry and corresponding to an active rotation. 

Exercise 16.2. Check the transformation (16.36) by visualizing a three- 
dimensional rotation. Verify it algebraically for a right-handed rotation about the 
z axis and express it in 3 X 3 matrix form. 


Exercise 16.3. Starting with an infinitesimal rotation about the unit vector 
h(n x , n y , n z ), prove that the rotation matrix R can be represented as 


R = exp(— <5 b n • X) 


(16.38) 
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where 

/O 0 0\ /O 0 -1\ / 0 1 0\ 

X x = 0 0 1 ) X y = 0 0 0 X, = -1 0 0 (16.39) 

\0 — 1 0 / \1 0 0 / \ 0 0 0 / 

are three antisymmetric matrices. 3 Work out their commutation relations and com- 
pare them with the commutation relations for the components of angular momentum. 

For an infinitesimal rotation, (16.36) reduces to 

(A)' = (A) + n X (A)e (16.40) 

We now substitute the expression (16.30) on the left-hand side of this equation and 
equate the terms linear in e on the two sides. Since x is an arbitrary state, it follows 
that 

7 n-JA-Ajn-J = nXA (16.41) 

n n 

or 

An J — n JA = [A, n J] = ihh X A (16.42) 

which is exactly the same condition as Eq. (11.19) derived in Section 11.1, except 
for the replacement of the orbital angular momentum operator L by J. This gener- 
alization was already anticipated in Section 11.2, where an algebraic approach to 
the eigenvalue problem of the angular momentum operator was taken. We can make 
use of the results derived there, since J itself is a vector operator and must satisfy 
(16.42): 

[J, n J ] = m X J (16.43) 

or, using the subscripts i,j, k, with values 1, 2, 3 to denote the Cartesian components 

y, z, 

Ui, Jj ] = ihs ijk J k or SijkJiJj = vhJ k (16.44) 

The Levi-Civita symbol e ijk was defined in Section 11.5. Taking care to maintain 
the order of noncommuting operators, we may combine these commutation relations 
symbolically in the equation 

J X J = ihj (16.45) 

Exercise 16.4. Employing the techniques developed in Section 3.4, verify that 
the commutation relations for A and J assure the validity of the condition (16.36) 
or, explicitly, 

expf— n • J4>) jA expf n • J0j = 

n(n • A)— n X (n X A) cos 0 + n X A sin </> 

(16.46) 

for finite rotations. 

3 Biedenharn and Louck (1981). See also Mathews and Walker (1964), p. 430. 
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Since the trace of a commutator is zero, the commutation relations (16.44) imply 
that the trace of every component of J vanishes. Hence, by (10.29), 

det U R = 1 (16.47) 


so that the matrices U R representing a rotation are unimodular. If we evaluate the 
determinant on both sides of Eq. (16.29), we then conclude that e ,<p(/?1 ’ /?2) = ±1 and 
(16.29) takes the more specific form 


U 2 U X = ±U 3 


(16.48) 


Applying successive finite rotations to a vector operator and using Eq. (16.46), 
it can be shown that the commutation relations for J are not only necessary but also 
sufficient for the unitary operator (16.33) to represent rotations and satisfy the 
requirement (16.48): 4 (For n - 2 a proof will be given in Section 16.4.) 

Although they were prompted by our interest in the two-dimensional intrinsic 
spin of the electron, none of the arguments presented in this section have depended 
on the dimensionality of the matrices involved. The states x and x' connected by 
the unitary matrix U in (16.25) could have n rows, and all results would have been 
essentially the same. In particular, the commutation relations (16.43) or (16.44) 
would then have to be satisfied by three n X n matrices. That a closed matrix algebra 
satisfying these commutation relations can be constructed for every nonnegative 
integer n was already proved in Section 11.2. We will thus be able to use the results 
of this section in Chapter 17, when we deal with angular momentum in more general 
terms. In the remainder of this chapter, however, we confine ourselves to the case 
n = 2, and we must now explicitly determine the Hermitian 2X2 matrices J which 
satisfy the commutation relations. 


4 . The Spin Operators, Pauli Matrices, and Spin Angular Momentum. Follow- 
ing the usual convention, we supposed in Section 16.2 that the z component of the 
vector operator p, the intrinsic magnetic moment, is represented by the diagonal 
matrix (16.24) and that the components c x and c 2 of the spinor x are the probability 
amplitudes for finding p z - -/3 0 (spin up) and +/3 0 (spin down), respectively. A 
rotation about the z axis can have no effect on these probabilities, implying that the 
matrix 



must be diagonal in the representation we have chosen. It follows that J z must itself 
be a diagonal matrix. 

Exercise 16.5. From the commutation relations, prove that if the z component 
of some vector operator is represented by a diagonal matrix, J z must also be diagonal 
(as must be the z component of any vector operator). 


The problem of constructing the matrices J in a representation in which J z is 
diagonal has already been completely solved in Section 11.2. The basis vectors (or 
basis spinors or basis kets or basis states) of this representation are the eigenvectors 
of J z . The commutation relations (11.24) are identical to (16.44). We now see that 


4 Biedenharn and Louck (1981), Section 3.5. 
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for the description of the spin of the electron we must use as a basis the two eigen- 
states of J z and J 2 , which correspond to j = 1/2 and m = ±1/2, From Eqs. (11.42) 
and (11.43) (Exercise 11.11) we obtain the matrices 

J+ = J x + iJ y = ^ and JL = J x - iJ y = ^ (16.49) 

It is customary and useful to define a vector operator (matrix) or proportional to the 
2X2 matrix J : 


T h 
J= 2 a 


Hence, 


0 * 4 - (T x ICTy ' 


0 2 

,0 0 ; 


and cr_ = cr r 


l (T v 


from which we obtain the celebrated Pauli spin matrices. 




Some simple properties of the Pauli matrices are easily derived. 

Exercise 16.6. Prove that the Pauli matrices are unitary and that 


O- 2 = O- 2 = (t\ = 1 


(16.50) 


(16.51) 


(16.52) 


(16.53) 


Exercise 16.7. Prove that 

<t x cr y = i(r z , cr y (T z = icr x , cr z (r x = icr y (16.54) 

and that any two different Pauli matrices anticommute: a x cr y + cr y cr x = 0, and so 
forth. 


Exercise 16.8. Prove that the only matrix which commutes with all three Pauli 
matrices is a multiple of the identity. Also show that no matrix exists which anti- 
commutes with all three Pauli matrices. 


The traces of all Pauli matrices vanish: 

trace a x = trace cr y = trace cr z = 0 (16.55) 

which is a reflection of the general property that the trace of any commutator of two 
matrices vanishes. It follows from the commutation relations (16.42) that the trace 
of any vector operator is zero. In the two-dimensional case ( n = 2), this implies that 
the z-component A z of every vector operator is proportional to J z and consequently 
that all vector operators A are just multiples of J: 

h 

A = kj = k - a (16.56) 

where k is a constant number. The proportionality of A and J, which generally holds 
only for n — 2, is the simplest illustration of the Wigner-Eckart theorem which will 
be derived in Chapter 17. 
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The four matrices 1, a x , a y , cr z are linearly dependent, and any 2X2 matrix 
can be represented as 

A = A 0 1 + A icr x + A 2 (r y + A 3 a z = A 0 1 + k • tr (16.57) 

If A is Hermitian, all coefficients in (16.57) must be real. 


Exercise 16.9. Take advantage of the properties (16.54) and (16.55) of the 
Pauli matrices to “work out the eigenvalues and eigenspinors of A in terms of the 
expansion coefficients A 0 and A.. Specialize to the case A 0 = 0 and A = n, where n 
is a real-valued arbitrary unit vector. 


Exercise 16.10. Show that if U is a unitary 2X2 matrix, it can always be 
expressed as 

U = e ,y (icos to + in ■ cr sin to) (16.58) 

where y and <u are real angles, and n is a real unit vector. 


Exercise 16.11. If A and B are two vectors that commute with cr, prove the 
useful identity 


(tr • A)(ct • B) = A ■ B + i(T ■ (A X B) 


(16.59) 


Applying the identity (16.59) to the power series expansion of an exponential, we 
see that (16.58) is the same as 


U = expO'y + icon • cr) = e ,y (l cos <w + m • tr sin co) 


(16.60) 


which is a generalized de Moivre formula. Any unitary 2X2 matrix can be written 
in this form. In the two-dimensional spin formalism, the rotation matrix (16.33) takes 
the form 


U R = exp 


1 

2 


4 > 


n • a 


(16.61) 


Comparing the last two expressions, we see that every unitary matrix with y — 0 
represents a rotation. The angle of rotation is <j> = ~~2co, and n is the axis of rotation. 
For y — 0 we have det U R = 1, and the matrix U R is unimodular. The set of all 
unitary unimodular 2X2 matrices constitutes the group SU( 2). The connection 
between this group and three-dimensional rotations will be made precise in Chap- 
ter 17. 

We may now write the rotation matrix (16.33) in the form 




<j> cf) 

1 cos in • ct sin — 

2 2 


(16.62) 


One simple but profound consequence of this equation is that for cf) = 2tt we get 
U = —l.A full rotation by 360° about a fixed axis, which is equivalent to zero 
rotation (or the identity), thus changes the sign of every spinor component. The 
double- valuedness of the spin rotation matrices is sanctioned, although not required, 
by the relation (16.48). Vectors (and tensors in general) behave differently: they 
return to their original values upon rotation. However, this sign change of spinors 
under rotation is no obstacle to their usefulness, since all expectation values and 
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matrix elements depend bilinearly on spinors, rendering them immune to the sign 
change. 

Exercise 16.12. Using the special properties of the Pauli matrices, prove di- 
rectly that 

Ur(tU r = n(n • a) - n X (n • n) cos </> 4- n X u sin <p (16.63) 
if U R is given by (16.62) 

Since the right-hand side of (16.63) is the expression for the rotated form of a 
vector, it is evident that if we perform in succession two rotations and R 2 , equiv- 
alent to an overall rotation i? 3 , we can conclude that 

U\U\erU 2 U l = U\tjU 3 


or 

[U 2 U,Ul OP] = 0 

From Exercise 16.8 we thus infer that U 2 U-JJ\ must be a multiple of the identity. 
Since the spin rotation matrices are unimodular (det U — 1), we are led back to 
(16.48), proving that in the case n = 2 the commutation relations are not only 
necessary but also sufficient to ensure the validity of the group property (16.48). 

It may be helpful to comment on the use of the term vector that is current in 
quantum mechanics. A vector V in ordinary space must not be confused with a (state) 
vector such as x in a (complex) vector space. In the context of this chapter, the latter 
is represented by a two-dimensional spinor, but in other situations, such as when 
describing the intrinsic degree of freedom of a spin-one particle, the state vector is 
three-dimensional. To contrast the different behavior of spinors and ordinary vectors 
under rotation, we consider the example of a rotation about the x axis by an angle 
0. From (16.62), (16.52), and (16.25), we obtain for the spinor components: 

, <f> . • <t> ‘ 

Ci = Ci cos — — ic 2 sin — 

2 2 

<f) <p 

c 2 = —ic i sin — l - c 2 cos — 

2 2 

The components of a vector V, on the other hand, transform according to 

V' = V x 

Vy = V y cos 4> ~ V z sin cf> 

V’ z — V y sin <f> + V z cos 4> 

The differences between these two transformations are apparent, but they are con- 
nected. If A is a vector operator, the spinor transformation induces the correct trans- 
formation among the components of the expectation value (A). 

We must now find the matrix representation of the physical observables that 
are associated with an electron or other spin one-half particle. Since, according to 
(16.56), the vector operator tr is essentially unique, we conclude from (16.24) that 
the intrinsic magnetic moment of an electron is given by 

M- = ~Po (r 

thus completing the program of determining the components of |x. 


(16.64) 
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What about the intrinsic angular momentum of the electron, its spinl It was 
shown in Section 16.1 that conservation of angular momentum is destroyed unless 
the electron is endowed with an intrinsic angular momentum, in addition to its orbital 
angular momentum. The interaction energy (16.6) responsible for compromising the 
spherical symmetry of the central forces is proportional to |x • L, which in turn, 
according to (16.64), is proportional to tr • L for a spin one-half particle. We express 
the wave function ifs for the state of the particle in the spinor form (16.9), 


/ if/+(x, y, z)\ 
\>p-(x, y,z) 


(16.65) 


When the interaction tr • L is applied to ijj, the operator L acts only on the functions 
if/+(x, z) of the coordinates, but tr couples the two spinor components. 

A term of the form cr • L in the Hamiltonian is often referred to as the spin- 
orbit interaction. As was explained in Section 16.1, an interaction of this form arises 
in atoms as a magnetic and relativistic correction to the electrostatic potential. It 
produces a fine structure in atomic spectra. In nuclei the spin-orbit interaction has 
its origin in strong interactions and has very conspicuous effects. 

In the presence of a spin-orbit interaction, L is no longer a constant of the 
motion. It is our hope that an intrinsic angular momentum S can be defined in such 
a manner that, when it is added to the orbital angular momentum L, the total angular 
momentum, 


J = L + S 


(16.66) 


will again be a constant of motion. Since S, like L, is a vector operator, it must be 
proportional to c. Indeed, the spin angular momentum S is nothing other than the 
generator (16.50) of rotations for spinors: 


S = 



(16.67) 


since both S and L are just different realizations of the generator J, which was 
introduced in its general form in Section 16.3. The unitary operator that transforms 
the state (16.65) of a particle with spin under an infinitesimal three-dimensional 
rotation must be given by 

U R = 1 — 7 en • S - 7 en • L = 7 — 7 en • J (16.68) 

n n n 

The scalar operator a • L is invariant under this rotation, and (16.66) is the desired 
constant of the motion. 

We can verify this identification by employing the commutation relation (16.42) 
for a vector operator twice. First, we let J = A = L and replace the vector n by S, 
which is legitimate because S commutes with L. This yields the equation 

[L, S • L] = ihS X L 

Next, we let J = A and S and replace n by L: 


[S, L • S] = ihL X S 
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Owing to the commutivity of L and S, we have SXL = — L X S and S • L = L • S; 
hence, it follows that J = L + S commutes with the operator L • S and is indeed 
conserved in the presence of a spin-orbit interaction. No other combination of L and 
S would have satisfied this requirement. 


Exercise 16.13 . Show that no operator of the form L + atr, other than J = 
L + (h/2)cr commutes with the scalar tr • L. 


Evidently, any component of the intrinsic angular momentum S defined by 
(16.67) has the two eigenvalues + hi 2 and —hi 2. The maximum value of a com- 
ponent of S in units of h is 1/2, and we say that the electron has spin 1/2. Further- 
more, we note that 

s 2 = a ■ cr = (o- 2 + a 2 y + of) = 1 (16.69) 

Hence, any spinor is an eigenspinor of S 2 , with eigenvalue 3/i 2 /4, corresponding to 
s = 1/2 if we express S 2 as s(s + l)h 2 . 

Thus, we see that when the spin is taken into account, J = L + S is the generator 
of infinitesimal rotations (multiplied by h), and conservation of angular momentum 
is merely a consequence of the invariance of the Hamiltonian under rotations. This 
broad viewpoint, which places the emphasis on symmetries, is the subject of Chap- 
ter 17. 


5. Quantum Dynamics of a Spin System. The general dynamical theory of Chap- 
ter 14 is directly applicable to any physical system with two linearly independent 
states, such as the spin of a particle in isolation from other degrees of freedom. In 
the Schrodinger picture, the time development of a two-component state or spinor 
\{t) is governed by the equation of motion, 

i 

ih ^7^ = H X {t) (16.70) 

at 

where the Hamiltonian H is in this instance a 2 X 2 matrix characteristic of the 
physical system under consideration. The essential feature of the equation of motion 
is its linearity, which preserves superpositions, but since we want to apply the theory 
to systems that can decay, we will at this stage not assume that H is necessarily 
Hermitian. 

Obviously, if there are no time-dependent external influences acting and the 
system is invariant under translation in time, H must be a constant matrix, indepen- 
dent of t. Under these conditions, Eq. (16.70) can be integrated, giving 

x(t) = exp^-“ tf^x(0) (16.71) 

in terms of the initial state *(0). 

As usual, it is convenient to introduce the eigenvalues of H, which are defined 
as the roots of the characteristic equation 


det (H - XI) = 0 


(16.72) 
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If there are two distinct roots A = E u E 2 with E 1 ¥= E 2 , we have 

H X\ = EiXi and Hx 2 = E 2 x 2 (16.73) 

and an arbitrary two-component spinor may be expanded as 

X = c iXi + C 2 X 2 (16.74) 


If H is not Hermitian, its eigenvalues will generally not be real. 

If f(z ) is a function of a complex variable, the function f(H) of the matrix H 
is a new matrix defined by the relation 

f(H)x = c 1 f(H) X i + c 2 f(H)x 2 = c x f(E y ) Xl + c 2 f(E 2 ) X2 (16.75). 

By substitution into (16.75), the equality 


/(H) = /(£,) 


E 2 1 - H 
E 2 — Ey 


+ f(E 2 ) 


EJ - H 
Ei - E 2 


(16.76) 


is seen to hold. If the characteristic equation has only one distinct root, so that 
E 2 — Ey, the preceding equation degenerates into 

f(H ) = f(Ei)l + f{Ei)(H - EJ) (16.77) 

Exercise 16.14. Prove Eq. (16.76), and derive (16.77) from it in the limit 
E 2 Ey. 


Equation (16.76) may be applied to expand the time development operator 

i 


f(H ) = exp 


-Htj in the form 


exp --/ft = 


Ei 


(E 2 e~ iEl,/h - Eie~ iE 2 ' ,h ) + 


H 


E 2 ~ Ei 


(16.78) 


if Ei 4 1 E 2 . A system whose Hamiltonian has exactly two distinct eigenvalues may 
be called a two-level system. The formula (16.78) answers all questions about its 
time development. 

From (16.70) it follows in the usual way that 


j t [x\t)x(m = ~ l ~ x\t)ifl - H^)x(t) (16.79) 

and if H is constant in time this may be integrated to give 

X\t)x(t) = x f (0)e m)Ht ‘e-° /h)Ht x(0) (16.80) 

If the matrix H is Hermitian, x f X is constant and probability is conserved. This must 
certainly happen if H represents the energy. If H is Hermitian, E x and E 2 are real 
numbers and the corresponding eigenspinors are orthogonal. 

If the Hamiltonian matrix is not Hermitian, the eigenvalues of H are complex 
numbers and can be expressed as 

.Ti r 2 

- l ~ and E 2 = E 02 - i — 


Ei — E 0 i 


(16.81) 
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where the real parts, E 0l , E Q2 , are the energy levels. If the imaginary parts, T , and 
r 2 , are positive, the two eigenstates are decaying states. The general solution of the 
dynamical equation (16.70) is the superposition 

X(t) = Cie -(.mE 0l ‘ e -r lt ,2 Xi + C2e -m>E 02 t e -r 2t i2 X2 (i 6 . 82 ) 

Unless the two decay rates are equal, the state does not generally follow a pure 
exponential decay law. 

As an application, consider the example of the probability amplitude for a tran- 
sition from an initial state a (“spin up”) to a state p (“spin down”). One gets 
immediately 

(p\T(t, 0)|a> = p f e- (i/h)Ht a = (0 \) e - (mHt \ M = (<T® 2 ' /ft - e~ iE ' ,,h ) 

\0/ E 2 E i 

(16.83) 

The probability obtained from this expression exhibits an interference term. 

As was mentioned in Section 16.3, the dynamics of two-state systems, with or 
without decay, is applicable in many different areas of physics, and the spin for- 
malism can be adapted to all such systems. Often a two-state system is prepared or 
created in a state other than an eigenstate of the Hamiltonian H, and its time devel- 
opment is subsequently observed, displaying intriguing oscillations in time, due to 
interference between the eigenstates of H. Qualitatively similar effects occur in other 
few-state systems, but the analysis is more complicated. We confine ourselves to the 
case of two-state systems and use the ordinary electron or nuclear spin 1/2 in a 
magnetic field as the standard example of the theory. 


Exercise 16.15. In many applications, conservation laws and selection rules 
cause a decaying two-level system to be prepared in an eigenstate of a z , say 


a 



and governed by the simple normal Hamiltonian matrix 


H = al + bcr x 


(16.84) 


where a and b are generally complex constants. In terms of the energy difference 
A E — Eq 2 ~ Eqx and the decay rates I^ and T 2 , calculate the probabilities of finding 
the system at time t in state a or state p, respectively. 


Exercise 16.16. If the Hermitian matrix T = i(H — /f t ) is positive definite, 
show that r\ and Y 2 defined by (16.81) are positive. Conversely, if T 1>2 > 0 and if 
the two decaying eigenstates, X\ and X 2 > of H are orthogonal (implying that H is a 
normal matrix), show that the time rate of change of the total probability x^iOxiO 
is negative for all states x at all times. Verify this conclusion using the results of 
Exercise 16.15 as an example. 


6. Density Matrix and Spin Polarization. In discussing two-level systems, we 
have so far characterized the states in terms of two-component spinors. In this sec- 
tion, we consider some other methods of specifying a state. The spinor (y 1; y 2 real) 



(16.85) 


characterizes a particular state. However, the same physical state can be described 
by different spinors, since x depends on four real parameters, but measurements can 
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give us only two parameters: the relative probabilities | ci | 2 :|c 2 | 2 and the relative 
phase, y 1 — y 2 , of c x and c 2 . If x is normalized to unity 

M 2 + |c 2 | 2 = 1 (16.86) 

the only remaining redundancy is the common phase factor of the components of x, 
and this is acknowledged by postulating that x and e ia x (<*•' arbitrary, real) represent 
the same state. 

An elegant and useful way of representing the state without the phase arbitrar- 
iness is to characterize it by the density operator defined in (1.4.19) as p — |M r )(Mi r |, 
and discussed in detail in Section 15.5. In the context of two-state quantum me- 
chanics, the density matrix of a pure state x is 


P = XX* 




(\ci \ 2 

Cicf\ 

\c 2 cf 

\c 2 \ 2 ) 


subject to the normalization condition (16.86) which requires that 


trace p = 1 


(16.87) 

(16.88) 


According to the probability doctrine of quantum mechanics, its knowledge exhausts 
all that we can find out about the state. 

The expectation value (A) of any operator A is expressed in terms of p as: 


(A) = X f Ax = (c? 



— c i 2 An + CjC* A 2 i + c 2 c*A l2 + c 2 2 A 22 


= trace 


( 


c \ I 


\c 2 c* 


Cl ct\(A n 

I C 2 | 2 / \A 21 



= trace(pA) 


(16.89) 


in accord with Eq. (15.105). 

Exercise 16.17. If A is a Hermitian matrix with eigenspinors u and v, corre- 
sponding to the distinct eigenvalues A[ and A 2 , show that the probability of finding 
A[ in a measurement of A on the state x is given by 

( A' — A \ 

p ^ _ - J = trac eipuu^) (16.90) 

where P A j = uu f represents the projection operator for the eigenvalue A[. 

Like any 2X2 matrix, p can be expanded in terms of the Pauli matrices cr*, 
< T y , o- z , and 1. Since p is Hermitian and its trace equals unity, it can according to 
(16.57) most generally be represented as 


p = - (/ + P • <r) 


(16.91) 


where P x , P y , P z are three real numbers given by 

P x — 2 Re(c*c 2 ) 

P y = 2 Im(ctc 2 ) (16.92) 

^ z =|ci| 2 -|c 2 | 2 

It is immediately verified that p has eigenvalues 0 and 1 . The eigenspinor that cor- 
responds to the latter eigenvalue is x itself, i.e., 


PX = X 


(16.93) 
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The other eigenspinor must be orthogonal to x ■ The matrix p applied to it gives zero. 
Hence, when p is applied to an arbitrary state <p, we have 

P<P = X(.X'<P) (16.94) 

since x is assumed to be normalized to unity. We thus see that p projects <p in the 
“direction” of x ■ It follows that the density matrix is idempotent: 

P 2 = P (16.95) 


Exercise 16.18. Show directly from (16.87) that the density matrix for a pure 
spin state is idempotent and has eigenvalues 0 and 1. 


If (16.91) is required to be idempotent and the identity (16.59) is employed, we 
obtain 


P • P = P 2 X + P 2 y + P 2 Z = P 2 - 1 (16.96) 

Hence, the state is characterized by two independent real parameters, as it should 
be. 

The expectation value of cr x in the state x I s 

11 1 

{cr x ) = trac e(po-J = - trace a x + - P • trace(o-<rJ = - p x • trace(o^) = P x 

where use is made of the fact that trace cr = 0 and cr 2 x — 1 . We get from this and 
analogous equations for cr y and cr z the simple formula 


P = (a) = trace(pa) = trace(crp) 


(16.97) 


proving that P transforms like a vector under rotations. 

Combining (16.91) with (16.93), we find that the spinor x is an eigenspinor of 
the matrix P cr : 

P <t X = X * 06.98) 

Hence, the unit vector P may legitimately be said to point in the direction of the 
particle’s spin. The vector P is also known as the polarization vector of the state. 
It may be characterized by the two spherical coordinates, the polar and azimuthal 
angles, which specify a point on the unit sphere. 


Exercise 16.19. Given a spinor 

X = 


e lct cos 8 




sin 8, 


(16.99) 


calculate the polarization vector P and construct the matrix U R which rotates this 
state into . Prove that the probability p h of finding this particle to be in a state 
represented by the polarization vector n is 


Pa = ^ trac e[p(l + n • a)] = ^ (1 + P • n) 


(16.100) 


and show that this result agrees with expectations for n = P, n = — P, and nlP. 
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Although the language we have used in describing the properties of P refers to 
spin and rotations in ordinary space, the concepts have more general applicability, 
and the formalism allows us to define a “polarization vector’’ corresponding to the 
state of any two-level system. The polarization P is then a “vector” in an abstract 
three-dimensional Euclidean space, and the operator 

eX p(~ ^ ^ 

induces “rotations” in this space. Examples are the isospin space in nuclear physics 
and the abstract polarization vector which can be defined to represent two isolated 
atomic states in interaction with an electromagnetic field as might be generated by 
laser light. 

The formalism is particularly useful to describe the polarization states of elec- 
tromagnetic radiation. Any two “orthogonal” polarization states may be chosen as 
the basis states for the representation, but the two (right and left) circular polariza- 
tion states are usually preferred. The general elliptic polarization state of a light 
wave or photon is a superposition of the basis states in a two-dimensional complex 
vector space, which in optics is known as the Jones vector space. The elements of 
the corresponding density matrix are essentially the Stokes parameters of the polar- 
ization state. The vector P provides yet another representation of the polarization 
state of light. To combat the almost inevitable confusion caused by the double mean- 
ing of the term polarization , in the context of the polarization of light P is best 
referred to as the Poincare vector, in recognition of the mathematician who intro- 
duced the two-sphere as a convenient tool for representing the elliptic polarization 
states. 

The time evolution of the density matrix p can be obtained from the equation 
of motion for X , 

ih = H X X (16.101) 

dt 



where H is assumed to be a Hermitian 2X2 matrix. Using the definition (16.87) of 
the density matrix, we obtain 


dp 

dt 


d X t , d X f 1 „ t 

= T, x + x ^=m Hxx 



or 


ih — — Hp — pH 
dt 


(16.102) 


All of these equations are merely concrete matrix realizations of the general for- 
malism of Sections 14.1 and 15.5. 


Exercise 16.20. Derive the properties of the density matrix that represents a 
stationary state. 

The equation of motion for any expectation value (A) is familiar: 

d{A) 


ih 


= (AH - HA) + ih 


dt 


dA 

dt 


(16.103) 
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It is instructive to derive the equation of motion for the vector P = (o'). To obtain 
a simple formula, it is convenient to represent the Hamiltonian operator H as 

n = \ ( Qoi + Q <r) (16.104) 


where Q 0 and the three components of the vector Q are real numbers, which may 
be functions of time. By (16.103), (16.104), and the spin commutation relations 
summarized in the equation cr X cr = 2iar, we derive 


dP 

dt 


d(a) 1 . . 1 . _ . 

— — = 77 (vH - Her) = — (<rQ ct - Q aw) 
dt in 2in 


= ~ (Q X (fr X = ~Q X (a) 


or 


. dP 

h — = Q X P 
dt 


(16.105) 


Since 


dP 2 d(P ■ P) 


dt 


dt 


= 2P 


dt h 


(Q X P) = 0 


the vector P maintains a constant length. This is merely another way of saying that, 
when the Hamiltonian is Hermitian, the normalization of x is conserved during the 
motion. 

If Q is a constant vector, (16.105) implies that P precesses about Q with a 
constant angular velocity 


w G 



(16.106) 


If 

P(0) = P 0 and Q/Q = Q 

the solution of (16.105) is 

P(0 = 0(Po ' 0) + [P 0 ~ Q(P 0 • Q)] cos oj Q t + Q X P 0 sin w Q t 

= Q(P 0 • Q) + Q x (Po x Q) cos a) Q t + 0 x Po sin (o Q t (16.107) 

A A « CO /I t A 

= P 0 cos a) Q t + 2Q(P 0 • Q) sin 2 + Q X P 0 sin a ) Q t 


Exercise 16.21. Show that if Q is constant, Q • P and ( dP/dt ) 2 are constants 
of the motion. Verify that (16.107) is the solution of (16.105). [See also Eq. (16.63).] 

If Q is a constant vector and the initial polarization P 0 is parallel to Q, it is 
seen from (16.107) and Figure 16.3 that P is constant and equal to 0 or — Q. These 
two vectors represent the two stationary states of the system. Their energies are 
given by the eigenvalues of H, but only the energy difference, A E, is of physical 
interest. Since Q • a has the eigenvalues +1 and — 1, the eigenvalues of H are 
(fio ± 0)/2 and 


A E = Q = h(x> Q 


(16.108) 
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Po-Q(Po-Q) 


Figure 16.3. Precession of the spin polarization vector about Q. The initial polarization 
vector P 0 and Q define the plane of the figure, and the precession angle 9 is the angle 
between P 0 and Q. The Rabi oscillations have the maximum amplitude sin 2 6. 


The probability of P(f) pointing in the direction — P 0 at time t is 


[1 - P„ • P(0] = (P 0 X Q) 2 sin 


2^G , _ 


(Po X Q) 2 


t (16.109) 


If, as indicated in Figure 16.3, we decompose the constant vector Q into two com- 
ponents parallel or longitudinal (Q=) and perpendicular or transverse (Q x ) to the 
initial polarization P 0 , 

Q = Q = + Q x = P 0 (Q • P 0 ) + P 0 X (Q X P 0 ) (16.110) 


(16.109) can be written as 



(16.111) 


where 9 is the angle between the polarization vector and Q (Figure 16.3). Formula 
(16.111) can be interpreted as describing periodic oscillations induced by the trans- 
verse field (Q x ) between two energy levels split by the longitudinal field (Q=) by 
an amount AE 0 = Q = . Generically known as Rabi oscillations, these transitions 
between “spin up” and “spin down” eigenstates states of the unperturbed Hamil- 
tonian, H 0 = (1 + Q= • or)/ 2, are caused by the constant perturbation H — H 0 = 
Q x - cr/2. In the special case of “resonance,” when kE 0 ~ Q= = 0, the maximum 
amplitude of the Rabi oscillations is unity, and the initial state is totally depleted 
whenever a> Q t = (2 n + l)v. We emphasize that this analysis is exact and does not 
rely on any perturbation approximation. 5 


5 For a full discussion of Rabi oscillations, with examples, see Cohen-Tannoudji, Diu, and Laloe 
(1977), Chapter IV. 
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The energy level splitting (16.108) is caused by the entire constant field Q. 
Transitions between the two stationary states can be induced if the spin system is 
exposed to a time-dependent oscillating field that has the same or a similar frequency 
as the spin precession. For example, if a spin 1/2 particle, whose degrees of freedom 
other than the spin can be neglected, is placed in a magnetic field B, the Hamiltonian 
can be written as 


H — fx B = — y — tr • B 


(16.112) 


The quantity y is the gyromagnetic ratio, and the vector Q is given by 


Q = —hy B 

A constant field B 0 causes a precession of P with angular velocity co 0 = — y B 0 . 6 If 
in addition an oscillating magnetic field with the same (or nearly the same) frequency 
is applied, the system will absorb or deliver energy, and the precession motion of P 
will be changed. These general principles are at the basis of all the magnetic reso- 
nance techniques that are so widely used in basic and applied science. 

A special case of an oscillating field, for which a solution of the equation of 
motion can easily be obtained, is that in which the vector Q rotates uniformly about 
a fixed axis. Suppose that id is its angular velocity. It is advantageous to change 
over to a frame of reference which is rotating with the same angular velocity. Viewed 
from the rotating frame of reference, Q is a constant vector. If we denote the time 
rate of change of P with respect to the fixed system by dP/dt, and with respect to 
the rotating system by dP/dt, we have 


dP 

dt 


— + to X P 

dt 


as is well known from the kinematics of rigid bodies; hence, 


h 


dp 

dt 


= (Q — ftw) X P 


(16.113) 


Since in the rotating frame of reference Q — /ito is a constant vector, the problem 
has effectively been reduced to the previous one. Equation (16.113) can therefore 
be solved by transcribing the solution of (16.105) appropriately. 


Exercise 16.22. If Q rotates uniformly about a fixed axis, the equation of 
motion (16.101) may conveniently be transformed to a frame of reference that rotates 
similarly. Derive the new Hamiltonian and show that it corresponds effectively to 
precession about the constant vector Q — hen, providing an independent derivation 
of (16.113). 


Exercise 16.23. If a constant magnetic field B 0 , pointing along the z axis, and 
a field B 1; rotating with angular velocity co in the xy plane, act in concert on a spin 
system (gyromagnetic ratio y ), calculate the polarization vector P as a function of 
time. Assume P to point in the z direction at t = 0. Calculate the Rabi oscillations 
in the rotating frame, and plot the average probability that the particle has “spin 


6 This is the quantum analogue of the classical Larmor precession described in Goldstein (1980), 
Section 5-9. 
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down” as a function of (o/a) 0 for a value of BJBq = 0.1. Show that a resonance 
occurs when <o = — yB 0 . (This arrangement is a model for all magnetic resonance 
experiments.) 

Although so far in this section the density matrix p for spin states was assumed 
to represent a pure spinor state almost every statement and equation involving p 
can be immediately applied to a mixed state, illustrating the general density operator 
theory of Sectioh 15.5. The only exceptions are propositions that assume that p is 
idempotent or that the polarization vector satisfies |P| = 1, since the conditions 
(16.95) or (16.96) are necessary and sufficient for the state to be pure and repre- 
sentable by a spinor x- 

A pure or mixed state is represented by a Hermitian density matrix whose ei- 
genvalues are positive and sum to unity, as required by (16.88). For any density 
matrix the inequality (15.120) holds: 

0 < trace p 2 < (trace p) 2 =1 (16.114) 

In terms of the polarization vector, we have 

p 1 = p + j (P • P - 1) 

If this identity is used in the inequality (16.114), we conclude that generally 
| P | < 1, and that for a proper mixed state, i.e., one that is not a pure state, |P| < 1. 
An unpolarized spin system has p = (1/2)7 and P = 0. In spin-dependent scattering 
processes, which are the subject of the next section, proper mixed states representing 
incident particle beams are the rule rather than the exception. 

7. Polarization and Scattering. The theory of scattering was developed in Chap- 
ter 13, neglecting the spin entirely. However, the forces that cause a beam of par- 
ticles to be scattered may be spin-dependent, and it is then necessary to supplement 
the theory accordingly. The incident particles with spin one-half are represented by 
a wave function of the form 

e ikz Xinc = xLcXinc = 1 

Following the procedure of Chapter 13, we must look for asymptotic solutions of 
the Schrodinger equation which have the form 

e ikr 

e ikz X inc + f(6, <P)~ (16-115) 

r 

but the scattering amplitude f(6, <p) is now a two-component spinor. Spin-dependent 
scattering of a particle occurs, for instance, if the Hamiltonian has the form 

p 2 

77 = ™ + V(r) + W(r)L • cr (16.116) 

2m 

representing a spin-orbit interaction term in addition to a central force. 

The superposition principle — and more specifically, the linearity of the Schro- 
dinger equation — allows us to construct the solution (16.115) from the two particular 
solutions that correspond to ^ inc = a and ^ inc = /3. These two special cases describe 
incident beams that are polarized along the direction of the initial momentum and 
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opposite to it. The polarization is said to be longitudinal. We are thus led to look 
for two solutions of the asymptotic form 

e ikr 

ipi = e ikz a + (S n a + S 21 , 3) — (16.117) 

r 

e ikr 

fa = e ikz p + (S 12 a + S 22 (3) — (16.118) 

r 

rhe quantities in parentheses are the appropriate scattering amplitudes. 

Exercise 16.24. Show that the incident waves e‘ kz a and e lkz fl are eigenstates 
of J z . What are the eigenvalues? 


Multiplying (16.117) by c 1 , and (16.118) by c 2 , and adding the two equations, 
ve obtain by superposition the more general solution 



dere S stands for 2X2 scattering matrix 



Xkr 


r SjXinc 



(16.119) 


(16.120) 


? depends on the angles 6 and <p, and on the momentum k. The scattering problem 
s solved if S can be determined as a function of these variables. 

The form of S can be largely predicted by invariance arguments, although its 
lependence on the scattering angle 6 can be worked out only by a detailed calcu- 
ation, such as a phase shift analysis. Here we will only deduce the general form of 
he scattering matrix. The basic idea is to utilize the obviou.s constants of the motion 
hat the symmetries of the problem generate. If A commutes with the Hamiltonian, 
hen if ip is an eigenfunction of H, Aip is also an eigenfunction of H, and both belong 
o the same energy. The state Aip may represent the same scattering state as i p, or a 
lifferent one of the same energy, depending on the asymptotic form of ip. 

Let us assume that, owing to spherical symmetry of the scattering potential, H 
s invariant under rotations and, according to Section 16.4, commutes with the com- 
>onents of J. Expression (16.116) shows an example of a spin-dependent Hamilto- 
lian with rotational symmetry. The incident waves in (16.117) and (16.118) are 
iigenstates of J z with eigenvalues +h/2 and —hi 2, respectively (Exercise 16.24). 
since the operator J z leaves the radial dependence of the scattered wave unchanged, 
he solutions (16.117) and (16.118) must both be eigenfunctions of J z By requiring 
hat 

T , _/l d 1 \ ft 

JA ' = %Tv + i a 'r' = 2*' 


r , J 1 9 1 \ h 


md 
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it is easily seen that S u and S 2 2 can be functions of 9 only and that the off-diagonal 
elements of the scattering matrix have the form 

S 12 = e~ ,<p X function of 9, S 2l = e l<p X function of 6 (16.121) 


Furthermore, the Hamiltonian H is assumed to be invariant under a reflection 
with respect to any coordinate plane. This is true for the spin-orbit interaction in 
(16.116), because both L and S or n are axial vector operators, and their inner 
product is a scalar operator. The operator for reflection in the yz plane is P x a x , where 
P x simply changes x into —x, and cr x has the effect of changing a spin state in such 
a way that {cr y ) and (a z ) change sign, while { <j x ) remains unchanged, as behooves 
an axial (or pseudo-) vector. (For a more general explanation of reflections and parity 
for systems with spin, see Section 17.9.) 

Since 

(t x ol = /3, o- x f3 = a 

the reflection in the yz plane changes the incident wave e lkz a into e‘ kz l3 and leaves 
e lkr /r invariant. Hence, (16.117) must go over into (16.118). In terms of spherical 
polar coordinates, P x has the effect of changing cp into tt — q>. It follows from this 
and (16.121) that 

Sn = S 22 = g(9), S 21 (~<P, 9) = -S l2 (cp, 9) = -e~^h(9) 


Consequently, we may write 


S - 


( g(e) 

\-h{9)e i(p 


h(9)e~ itp \ 

8 ( 0 ) ) 


— g(9)l + ih(9)(o- y cos cp — <j x sin cp) 


(16.122) 


The unit vector n(— sin cp, cos cp, 0) is normal to the plane of scattering and points 
in the direction of k inc X k scatt . We conclude that the scattering matrix has the form 


S = 8(0)1 + ih(9)h • <r 


(16.123) 


The functions g(9) and h(9) are generalizations for the spin 1/2 case of the scattering 
amplitude f(9) in Chapter 13. For rotationally invariant potentials, they can be par- 
ametrized by a generalization of the phase shift analysis of Section 13.5, but if they 
are to be computed from the assumed interaction, a set of coupled radial Schrodinger 
equations must ultimately be solved. The terminology “spin-flip” amplitude for h(9) 
and “non-spin-flip” for g(9) is self-explanatory. 


Exercise 16.25. Show that the same scattering matrix is obtained by requiring 
reflection symmetry with respect to the xz plane. 


Knowing S, we can calculate the intensity of the beam for a given direction. If 
(16.119) is the asymptotic form of the wave function, then by a straightforward 
generalization of the results of Chapter 13, the differential scattering cross section 
is found to be 


^ = (5'Tmc) t S / ymc = xU'Sxt.c (16.124) 

which is merely the analogue of |/(0) | 2 for a particle with spin. If the density matrix 
p inc describes the state of spin polarization of the incident beam, whether the state 
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be pure or — as is frequently the case — mixed, this expression may be generalized 
to 


da . 

— = trace (p inc S + .S) 


(16.125) 


Since 5^ inc is the state of the particles scattered from an incident spin state ;f inc into 
the specified direction, the density matrix corresponding to the scattered part of the 
wave function is 


Pscatt 


Spi 


JS' 


= Sp inc S f 
txace(Sp inc S f ) da/dd 

Using the form (16.123) for the scattering matrix and 


Pine = “ (1 + P 0 • O’) 


(16.126) 


(16.127) 


for the incident density matrix, we obtain the differential cross section in terms of 
;he polarization P 0 of the incident beam: 


da 

dtt 


g\ 2 + \h\ 2 + i(g*h - gh*) P 0 • n 


The polarization of the scattered beam is 


P 


(a) = trace p scatt a 


trace(5p inc 5 t or) 

da/dCl 


(16.128) 


(16.129) 


jf we use (16.123) to evaluate the trace, we obtain 

Qg| 2 ~ \h\ 2 )~P 0 + i(g*h - gh*)h + 2 1 h [ 2 P 0 n n + (g*h + gh*) P 0 X n 


P = 


g | 2 + \h\ 2 + Kg*h - gh*) P 0 • n 


(16.130) 


f the initial beam has transverse polarization and the scattering plane is chosen 
)erpendicular to P 0 , or P 0 = P 0 fi, it follows from (16.130) that 


(jgl 2 + \h\ 2 )P 0 + i(g*h - gh*) 
|g| 2 + \h\ 2 + i(g*h - gh*)P 0 


(16.131) 


f the incident beam is unpolarized, P 0 = 0, the scattered beam is polarized normal 
o the scattering plane: 


P = Ph = 


. g*h 


gh* „ 

n 


|g| 2 + l^l 


(16.132) 


Exercise 16.26. Show that if the incident spin state is a pure transverse po- 
arization state, the scattering amplitudes for the initial polarizations P 0 = ±n are 
r ± ih and the scattering leaves the polarization unchanged, P - P 0 . 

Exercise 16.27. Show that the magnitude of the polarization given by (16.132) 
atisfies the condition 1 > ]P[ > 0. Hint: Consider |g - ih\. 


If the y axis is chosen to be along the direction of the transverse component of 
he polarization, P 0 - P 0 • k k, we may write P 0 • n = | P 0 - P • k k | cos <p. With 
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these conventions, formula (16.128) for the differential cross section shows that the 
scattered intensity depends on the azimuthal angle as I — a(6) + b(6) cos <p, in 
agreement with the empirical statement (16.7) in Section 16.1. In this way, we find 
substantiated our original supposition that the right-left asymmetry in the scattering 
of polarized beams of particles is a consequence of the particle spin. 

Exercise 16.28. Assuming that P 0 is perpendicular to the scattering plane, 
evaluate the asymmetry parameter A, defined as a measure of the right-left asym- 
metry by 

(da/di 1) + - (dam)- 

(do/dSl) + + ( dam )_ 1 ' j 

where the subscripts + and — refer to the sign of the product P 0 • n. Show that if 
P 0 = ±n, the asymmetry A equals the degree of polarization P defined in (16.132). 
In particle polarization experiments, this quantity is referred to as the analyzing 
power. 


8. Measurements, Probabilities, and Information. The spin formalism is so easy 
to survey that it lends itself particularly well to a demonstration of how quantum 
mechanics is to be interpreted, and how it relates to experiment, observation, and 
measurement. By using the 2 X 2 density matrix formalism to represent an arbitrary 
mixed spin state, we will be able to keep the discussion as general as possible. We 
assume that the spin state of the system is entirely specified by the density matrix 
p. Illustrating the general concepts of Section 15.5, we ask what kinds of ensembles 
might represent a known p, and what observables might be measured to determine 
an unknown p. 

It is again convenient to represent the density matrix by the real-valued polar- 
ization three-vector P, such that 

p =i (1 + P • a) (16.134) 

£ 


Its eigenstates are represented by the pure-state density matrices: 


p* = j y + p 


a) and p_ = - (7 - P • tr) 


(16.135) 


which correspond to eigenvalues p+ = (1 + P)/2 and p_ = (1 — P)/2. The von 
Neumann entropy for this density matrix is, according to (15.128), 


S(p) ~ ~P+ In p~ ~ p~ In p_ 

= In 2 - | [(1 + P) ln(l + P) + (1 - P) ln(l - P)] 


(16.136) 


The given density matrix may be realized by any ensemble of N pure states with 
polarization vectors P, and probabilities p { with ^ Pi = 1 > such that 


N 


p = X 


Pi Pi 


i— 1 


The Shannon mixing entropy (15.126) of this ensemble is 

N 

H(%) = Pi In Pi 


(16.137) 


(16.138) 
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Exercise 16.29. As an example consider the 2 X 2 density matrix defined by 
he polarization vector 

P = \ (* + y + z) (16.139) 

tnd realized by an ensemble % of the N = 3 equiprobable pure states that correspond 
;o the spin pointing in the directions of the Cartesian coordinate vectors. For this 
nixed state, compute and compare the Shannon mixing entropy, H{%), and the von 
Neumann entropy, S{p). 


We now consider the measurement of the observable <t • n, which corresponds 
to projection operators (POM) 

A + = — (7 + cr • n) and A_ — - (7 — cr • n) (16.140) 

The probability that the system is found with spin in the direction n is the expectation 
value of the projection operator for the eigenstate of or ■ n: 


Pt> = ^ trace[p(7 + n • cr)] 

1 

= ^ t 1 + (Pn “ P 22 ) n z + p l2 (n x + in y ) + p* 2 (n x - in y )] 


(16.141) 


subject to the normalization condition 

trace p = p n + p 2 2 = 1 (16.142) 

In terms of the polarization vector (Exercise 16.19), 

p fi = j trace[(7 + P • cr)(7 + n • cr)] = \ (1 + P • ft) (16.143) 

If three linearly independent observables cr ■ n l5 cr • n 2 , ct • n 3 are measured, using 
ensembles with the same density operator (although, of course, not the same parti- 
cle), the matrix elements of p can be determined. This is similar to the description 
of the polarization state of a beam of light, which generally requires the measurement 
of the three Stokes parameters for its determination. 

For example, the only possible results of a measurement of cr z (or of any other 
component of cr) are +1 or — 1. By measuring cr z for a very large number of replicas 
of the system, all prepared in the same state, we can determine the density matrix 
element p u = (1 + P z )/ 2 = p z , which represents the relative frequency of finding 
“spin up” in the z direction. Other choices of the direction n provide more infor- 
mation about the magnitudes and phases of the density matrix elements. 

The outcome entropy for a measurement of o- • n is, according to (15.131), 



1 r 1 

1 [l 

77(<r • 1 

i) = -- (1 + P • fi) In - (1 + P • n 

) - - (1 - P • n) In - (1 - P • n) 



(16.144) 


As expected from Eq. (15.131), this entropy reaches its minimum value, the von 
Neumann entropy (16.136), when the measured spin points in the direction of the 
polarization vector: n = P. Figure 16.4 shows how the Shannon entropy for the 
outcome of the measurement of cr • n depends on P • n. 
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H(cr ■ n) 



Figure 16.4. Outcome entropy H(tj ■ ft) for a measurement of tr • ft as a function of P • ft. 

Exercise 16.30. For the state specified by the polarization vector (16.139), 
calculate the Shannon entropy, H(a • n), for the outcome of a measurement of <r • n, 
with n pointing along any one of the three coordinate axes. Compare the answer 
with the value of the von Neumann entropy of the state. 

Exercise 16.31. If p represents the pure state, 

X = c 1 a + c 2 /3 = \c 1 \e ,yi a + \c 2 \e iy2 P (16.145) 

and if n is a unit vector in the yz plane making an angle 0 with the z axis and 
90° — 6 with the y axis, show that the probability for a • n to yield the value + 1 is 

0 0 

Pa = ki| 2 cos 2 - + |c 2 | 2 sin 2 — — |c t ] | c 2 1 sin(yj - y 2 ) sin 0 (16.146) 

Similarly, the probability for the value — 1 is given by 

0 d 

P-a = | Ci | 2 sin 2 - + |c 2 | 2 cos 2 - + | c z 1 1 c 2 1 sin^ - y 2 ) sin 0 (16.147) 

Exercise 16.32. Write down the density matrix that represents the pure spin 

state 



and compare this with the density matrix for the mixed state about which we only 
know that the probability of “spin up” is one-third, and the probability of “spin 
down” is two-thirds. Calculate the von Neumann entropy for these two states. 

Exercise 16.33. For a mixed state given by the density matrix 

W 41 7 + 7A 

75 \7 - li ■* 34 ) 

check the inequalities (15.120), and calculate the eigenvalues and eigenstates. Eval- 
uate the von Neumann entropy, and compare this with the outcome entropy for a 
measurement of cr z . 
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Chapter 16 The Spin 


A molecular beam experiment of the Stern-Gerlach type has traditionally been 
sgarded as the prototype of a measurement, fundamental to a proper understanding 
f quantum mechanics. When, as depicted in Figure 16.1, the z component of the 
pin is measured, there is a bodily separation of the particles that the experimenter 
ubjects to the question, “Is the spin up or down?” The beam splits into two com- 
onents made up, respectively, of those particles that respond with “up” or with 
‘down” to this experimental question. 

Before the particle interacts with the measuring apparatus, the preparation of 
:s state is assumed to introduce no correlations between the spin and space degrees 
f freedom. Thus, initially the state has the simple product form 

Pinitial = Pspin ® P(f, **') (16.148) 

/here p spin denotes the spin state and p(r, r') the purely spatial part of the density 
latrix. The probabilities for “spin up” and “spin down” in this state are 

P±n = ~ trac e[p spin (i ± n • cr)] 

The interaction with the Stern-Gerlach magnet causes the product state (16.148) 
3 change into a more complicated correlated, or entangled, state. A careful analysis 
hows that in the region near the magnet where the two beams are well separated, 
he state of the particles can be represented as 

Pfmai = Pa \ (1 + n • or) 0 p up (r, r') + p_ fl ^ (i - n ■ o-) 0 p dow „(r, r') 

(16.149) 

n this expression, p up (r, r') and p down ( r, r') are spatial density matrices that de- 
cribe the two separated particle beams. Usually, these spatial density matrices can 
»e approximated in terms of wave packets moving along classical trajectories. The 
lesign of the apparatus ensures that they do not overlap and differ from zero only 
n the region traversed by the upper or lower beam, respectively. The upper com- 
ionent p up (r, r') is said to be correlated with the spin state in which cr • n is +1, 
nd the down component p down (r, r') is correlated with the spin state in which cr • n 
s — 1. In the measurement, a particle reveals a spin “up” or “down” with proba- 
•ilities equal to p +& and p _ ft . If by some ingenious manipulation the two separated 
'earns are recombined, additional terms must be included in (16.149) to account for 
he phase relations in the spin density matrix, which are lost if only the separated 
ieams are considered. 

In this connection, it is interesting to give some thought to a multiple Stern- 
lerlach experiment in which two or more spin measurements are carried out in 
eries. Let us assume again that a z is measured in the first experiment. If in the 
econd experiment ar z is remeasured, we will find that every particle in the upper 
learn has spin up, and every particle in the lower beam has spin down. Neither beam 
s split any further, confirming merely that conceptually the ideal Stern-Gerlach 
xperiment is an exceptionally simple kind of measurement. Although it can change 
he state profoundly, from (16.148) to (16.149), this particular measurement does 
tot alter the statistical distribution of the measured quantity (a z ), nor does the spin 
tate change between measurements. If in the second measurement the inhomoge- 
leous magnetic field has a different direction, and thus a different component of the 
pin, say a y , is measured, we will find that each beam is split into two components 
>f equal intensity, corresponding to the values +1 and —1 for cr y (Figure 16.5). 

This example shows the unavoidable effect which a measurement has on the 
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■P-*! = 2 trace b>G- "l ' ^ 

Figure 16.5. Successive Stern-Gerlach measurements of the spin projections fii • a, n 2 • a, 
n! • <r, producing pure “spin up” and “spin down” states. Each box symbolizes a beam 
splitting. The spin state of the incident beam is represented by the density matrix p. For 
each beam segment the spin component of the density matrix is specified. If rfi # ±n 2 , the 
second beam splitter regenerates the “spin down” polarization state for direction rfi from 
particles that entered it entirely with “spin up” along direction flj. 


system upon which the measurement is carried out. If p (short for p spin ) is the spin 
state before the measurement, and cr y , rather than cr z , is measured in a first experi- 
ment, then according to (16.134) the probability of finding +1 is 

1 1 i 

Py = 2 trace ^^ + °V)] = 2 + 2 ^ Pl2 “ p * 2 ^ (16.150) 

whereas, if we precede this cr y measurement by a measurement of a z , the probability 
of finding a y to be +1 is simply pn/2 + p 22 /2 = (pn + P 22 V 2 = 1/2, in accordance 
with the common rule of compounding conditional probabilities. The probability p y 
in (16.150) differs from Pn/2 ■f P 22/2 by an (off-diagonal) interference term, ^vhich 
the intervening cr z measurement must wipe out if the probability interpretation of 
quantum mechanics is to be consistent. If in a third successive Stern-Gerlach mea- 
surement <j z is measured again (Figure 16.5), we find anew a splitting of the beam, 
showing that the intervening measurement of a y has undone what the first cr z mea- 
surement had accomplished. In the language of particle physics, we may say that 
the cr y measurement has regenerated an amplitude for cr z with value — 1 in the branch 
in which the first measurement of a z had produced a pure + 1 spin state. 

In an ideal arrangement of this kind, two observables A and B are termed com- 
patible if for any state of the system the results of a measurement of A are the same, 
whether or not a measurement of B precedes that of A. In other words, A and B are 
compatible if measuring B does not destroy the result of the determination of A. 
Clearly, this can happen only if the eigenstates of A are simultaneously also eigen- 
states of B, According to the arguments presented in Section 10.4, the necessary and 
sufficient condition for this is that the matrices representing A and B commute: 

AB - BA = 0 

Two observables are compatible if and only if the Hermitian matrices representing 
them commute. 

For example, cr z and cr y are incompatible, for they do not commute; a state 
cannot simultaneously have a definite value of cr z and ay If we wish to measure a z 
and cr y for a state p, two separate copies of the system must be used. The two 
components of the spin cannot be measured simultaneously on the same sample. 
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A measurement of the simple kind described by the initial state (16.148) and 
e final correlated state (16.149) is an example of an ideal measurement (sometimes 
died a measurement of the first kind ) because the spatial separation of the two spin 
jmponents allows the unambiguous identification of the two spin states and the 
‘impulsive”) measuring interaction leaves the two spatially separated spin states 
ttact. If we consider the spin properties of the particles in isolation, the Stern- 
erlach device may be regarded as a spin filter that allocates fractions p +A and p_ fl 
f the particles definitely to the pure spin states represented by the density matrices 
' + n • cr)/2 and (I — n • tr)/2, respectively. A correlated or entangled state like 
16.149), in which the various eigenstate projections of the dynamical variable that 
being measured are prevented from interfering after the measurement, is some- 
mes loosely interpreted by saying that the act of measurement “puts” the system 
ito an eigenstate. 

The acquisition of information provided by the measurement and the subsequent 
jplacement of the original correlated state by selection of one or the other of its 
omponents with definite probabilities is conventionally referred to as the reduction 
f the state. In the spirit of the statistical interpretation of quantum mechanics, the 
jduction of the state — also known more dramatically as the collapse of the wave 
acket — ns not meant to describe a physical process that affects the (probability) 
mplitudes by actual measurement manipulations. Only with this proviso is it de- 
msible to say that after the reduction has taken place in an ideal measurement, the 
ystem has a definite value of the observable, namely, the eigenvalue determined by 
re measurement. A repetition of the measurement of the same quantity in the new 
tate will now yield with certainty this very eigenvalue. 

While the idealized Stern-Gerlach experiment illustrates many salient issues in 
uantum mechanics, the great variety of actual experiments defies any effort to clas- 
ify all measurements systematically. Most measurements are more difficult to an- 
lyze, but for an understanding of the physical significance of quantum states it is 
ufficient to consider the simplest kind. 

In the persistent debate about the foundations of quantum mechanics and the 
|uantum theory of measurement, we take the position that'the assignment of prob- 
bilities to the outcomes of various possible tests, acquired through experimental 
evidence, inspired guesswork, or other inferential procedures, is an indispensable 
>art of the specification of a quantum system. In particular, in this view there is no 
eason to draw a line and make a qualitative distinction between a probability-free 
‘objective” physical reality and the “subjective” realm of the observer who uses 
>robabilities for interpreting the data. Rather, we regard the acquisition of infor- 
nation, and its evaluation in terms of probabilities, as an integral part of a full 
lescription of the physical system and its evolution. 


°roblems 

L. The spin-zero neutral kaon is a system with two basis states, the eigenstates of cr z , 
representing a particle K° and its antiparticle K°: The operator a x — CP represents 
the combined parity ( P ) and charge conjugation (C), or particle-antiparticle, trans- 
formation and takes a = |j£°) into j8 = | K°). The dynamics is governed by the Ham- 
iltonian matrix 


r 

H = M - I - 
2 
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where M and T are Hermitian 2X2 matrices, representing the mass-energy and decay 
properties of the system, respectively. 7 The matrix T is positive definite. A funda- 
mental symmetry (under the combined CP and time reversal transformations) requires 
that a x M* — Ma x and cr^T* = Y<r x . 

(a) Show that in the expansion of H in terms of the Pauli matrices, the matrix 
cr z is absent. Derive the eigenvalues and eigenstates of H in terms of the matrix 
elements of M and T. Are the eigenstates orthogonal? 

(b) Assuming, as is the case to good approximation, that the Hamiltonian also 
satisfies the CP invariance conditions cr x M = Ma x and <j x T = Ta x show that H is 
normal, and construct its eigenstates, |AT?) and \K 2 ). If the measured lifetimes for 
these two decaying states are r i — h/Ti - 0.9 X 10~ 10 sec and r 2 = h/T 2 = 0.5 X 
10 -7 sec, respectively, and if their mass difference is m 2 — m x — 3.5 X 10 -6 eV/c 2 , 
determine the numerical values of the matrix elements of M and T as far as possible. 

(c) If the kaon is produced in the state K° at t = 0, calculate the probability of 
findingjt still to be a AT 0 at a later time t. What is the probability that it will be found 
in the K° state? Plot these probabilities, exhibiting particle-antiparticle oscillations, 
as a function of time. 


7 See Perkins (1982) for experimental information on neutral kaons. 



CHAPTER 17 


dotations and Other Symmetry Operations 


Although symmetry arguments have already been used in almost every 
chapter, here we begin a systematic examination of the fundamental 
symmetries in quantum mechanics. The concepts are best understood by 
thinking about a concrete example. Rotations exhibit the interesting 
properties of many symmetry operations, and yet their theory is simple 
enough to keep the general features from being obscured by too much 
technical detail. If the theory of rotations is to be transferrable to other 
symmetries, it must be seen in the more abstract context of symmetry 
groups and their matrix representations. Much of the chapter is devoted to 
the practical problems of adding angular momenta and the extraction of 
symmetry-related properties of matrix elements of physical observables. 

In the last two sections, we deal with discrete symmetries (space 
reflection and time reversal) and their physical implications, and we return 
briefly to local gauge symmetries, which are distinctly different from 
global geometric symmetries. 


. The Euclidean Principle of Relativity and State Vector Transformations. The 
undamental assumption underlying all applications of quantum mechanics is that 
•rdinary space is subject to the laws of Euclidean geometry and that it is physically 
lomogeneous and isotropic. By this we mean that we can move our entire physical 
ipparatus from one place to another and we can change its orientation in space 
vithout affecting the outcome of any experiment. We say that there is no preferred 
>osition or orientation in space. The assumption that space is homogeneous and 
sotropic will be called the Euclidean principle of relativity because it denies that 
ipatial location and orientation have any absolute significance. 

Gravity seems at first sight to introduce inevitably a preferred direction, the 
vertical, into any experiment performed on or near the surface of the earth, but in 
luantum physics we are concerned primarily with atomic, nuclear, and particle pro- 
cesses in which gravitational effects play a negligible role. The apparent anisotropy 
)f space can then usually be ignored, and the isotropy of space for such quantum 
Processes can be tested directly by rotating the system at any desired angle. If grav- 
tation cannot be neglected, as in some extremely sensitive neutron interferometry 
measurements , 1 there is again no conflict with the Euclidean principle of relativity, 
because we can imagine the earth to be part of the mechanical system and take its 
gravitational field into account when a rotation is performed. No violation of the 
Euclidean principle of relativity has ever been found in any laboratory experiment. 

On a grander, astronomical and cosmological scale there are legitimate serious 
questions about the validity of the principle. Understanding the physics of the very 
;arly universe may require a fully developed theory that unites gravity with quantum 
mechanics. The scale at which quantum gravity is expected to be influential, called 


Werner (1994). 



1 The Euclidean Principle of Relativity and State Vector Transformations 


411 


the Planck scale, is characterized, on purely dimensional grounds, by the Planck 
mass, 

M P c 2 = « 10 28 eV = 10 16 TeV 

The corresponding Planck length is of the order of 10 -35 m, and the Planck time is 
10~ 43 sec. These estimates make it clear why we will not be concerned with gravity. 

Here we focus on the remarkable consequences that the Euclidean principle of 
relativity and its extension to the time dimension have for the structure of quantum 
mechanics. We will find that this principle severely restricts the possible forms that 
the quantum description of a given system can take. 

A transformation that leaves the mutual relations of the physically relevant 
aspects of a systetn unaltered is said to be a symmetry operation. The Euclidean 
principle of relativity amounts to the assumption that geometric translations and 
rotations are symmetry operations. We first concentrate on rotations about an axis 
or a point, but in Section 17.9 we will extend the discussion to reflections. Nonrel- 
ativistic Galilean symmetry, involving transformations that include motions in time, 
was discussed in Section 4.7. The symmetry operations associated with the Einstein 
principle of relativity are based on Lorentz or Poincare transformations and will be 
taken up in Chapters 23 and 24. 

When a quantum system with a state vector SP is rotated in space to a new 
orientation, the state vector changes to 'T. The Euclidean principle of relativity 
requires that under rotation all probabilities be invariant, i.e., all inner products of 
two rotated states remain invariant in absolute value. We thus have a mapping of 
the vector space onto itself, M r 1 J f ', such that |0T, d>')| 2 = |(ty, <E>)] 2 for every 
pair of state vectors. Such a mapping is called an isometry. The mapping must be 
reversible, because we could equally well have started from the new orientation and 
rotated the system back to its old orientation. In the language of Section 16.3, we 
are considering active rotations. 

Generally, we do not require invariance of inner products, which is the hallmark 
of unitary transformations, but only that the absolute values be invariant. Yet be- 
cause of a remarkable theorem, we will ultimately be able to confine our attention 
essentially to unitary and antiunitary transformations. The reasoning given here ap- 
plies to any symmetry operation and not just to rotations. 

Theorem. If a mapping M* i-* of the vector space onto itself is given such that 

lor, o')l 2 = | op, o)| 2 (i7.i) 

then a second isometric mapping ’T W, which is merely a phase change of every 
vector, 

V" = e (17.2) 


can be found such that 


<sr = v a + v b 


is mapped into 


+ VI 
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For the proof of this theorem the reader is referred to the literature. 2 The the- 
orem shows that through rephasing of all vectors we can achieve a mapping that has 
me of the two fundamental properties of a linear operator: The transform of the sum 
af two vectors is equal to the sum of the transforms of the two vectors [see (9.43)]. 
[t follows from this result and from (17.1) that 

| OF", V’ + VI) I 2 = \(V a , V a + V b )\ 2 
Hence, by applying (17.1) again, 

(VI VI) + OF", = (V a , V b ) + (V a , V b )* 


ar 


Re(^", VI) = Re(V a , V b ) 

Since the absolute value of the inner product (^ a , M*),) is invariant, we must have 

ImOR", ¥£) = ±lm(V a , V b ) 

The 4- sign implies that 

(VI VI) = (' V a , %) (17.3) 

and 

(A^)" - XV" (17.4) 

whereas the — sign implies that 

(VI VI) = ( V a , %)* (17.5) 

and 

(XV)" = X*V" (17.6) 

Equation (17.4) expresses the second fundamental property of a linear operator [see 
(9.44)], and from condition (17.3) we infer that in the first case the transformation 
is unitary. Equation (17.6), on the other hand, characterizes an antilinear operator 
[see Eq. (9.46)]. 

It is easy to see the profound implications of this theorem. State vectors that 
differ by phase factors represent the same state, and a rephasing transformation has 
no physical significance. It follows that in studying the symmetry operations of a 
physical system we may confine ourselves to two simple transformations — those 
that are linear and those that are antilinear. Any more general symmetry transfor- 
mation can be supplemented by a phase change and made to fall into one of these 
two fundamental categories, which are mutually exclusive. Note that the rephasing 
operation is generally not unitary because different state vectors are generally mul- 
tiplied by different phase factors. 

If the symmetry operation is a rotation, the antilinear case is excluded as a 
possibility because rotations can be generated continuously from the identity oper- 
ation, which is inconsistent with complex conjugation of a multiplier. Antilinear 
transformations are important in describing the behavior of a system under time 
reversal, a topic to which we will return in Section 17.9. 

2 See Wigner (1959), Appendix to Chapter 20, p. 233; see also Bargmann (1964). 
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2. The Rotation Operator, Angular Momentum, and Conservation Laws. The 
result of the last section is that, if the Euclidean principle of relativity holds, rota- 
tions in quantum mechanics are represented by unitary transformations, validating 
the assumption made in Section 16.3. Although the discussion in Section 16.4 was 
phrased in terms of spinors describing the state of a spin one-half system, the for- 
malism of rotation operators (or matrices) was in no way dependent on the special 
nature of the system. The unitary operator that in three-dimensional space rotates a 
state |^) into j 1 ^ 4 ) = E/*|‘'I r ) has the form 

U R = ex pf - ^ (17.7) 


and the Hermitian generators of rotation, J, must satisfy the commutation relations 
(16.44): 


[J,, Jj ] ihe iJk d h or s ihJk 


(17.8) 


Since the trace of J] vanishes, the operators (17.7) are unimodular. We know from 
Chapters 11 and 16 that orbital angular momentum operators L = r X p and spin 
angular momentum operators S satisfy the commutation relations (17.8). They are 
realizations of the generic angular momentum operator J. Planck’s constant h was 
introduced into the definition of the rotation operator in anticipation of the identi- 
fication of the operators J as angular momentum for the system on whose states 
these operators act. 

In Section 11.2, we determined the eigenvalues and eigenvectors for all the 
Hermitian operators J that satisfy the commutation relations (17.8), as well as the 
matrices that represent the generalized angular momentum. We now make use of the 
results obtained there. The eigenvalues of any component of J, such as J z , are mh, 
and the eigenvalues of J 2 are j(j + 1)/L 2 . The quantum number j takes on the values 
j = nonnegative integer or half-integer, and m is correspondingly integer or half- 
integer subject to the restriction — j < m < j. Suppressing any other relevant infor- 
mation that characterizes the state, we denote the eigenvectors by \jm). Since all 
nonnegative integers are expressible as 2 j + 1, the angular momentum algebra can 
be realized in a vector subspace of any number of dimensions. 

In constructing the rotation operator explicitly, we must take into account a 
further condition that arises because the same rotation R is represented by all op- 
erators of the form 


U R = exp 


~~t ■ J(0 + 277*) 
n 


= exp( — j n • Jd> ) exp( — — n • J27 t* 


h 


h 


where k is an arbitrary integer. The factor exp(-277*m • J lh) is a unimodular op- 
erator whose effect on an eigenstate \jm) of n • J is simply to multiply it by 
exp (—imlirk) = ( — \) 2km . This is +1 for integer km and —1 for half-integer km. If 
a physical state were represented by a superposition of angular momentum eigen- 
vectors with both integral and half-integral values of j, then since the components 
with integer j (and m) would remain unchanged while the components with half- 
integral j (and m) would change sign, application of the rotation operator 
exp(-277*in • J lh) with k = odd would produce an entirely different state vector. 
Yet, for systems of point particles such a rotation is geometrically and physically 
equivalent to no rotation at all and behaves like the identity. In other words, the 
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nathematical framework allows for state vectors that have no counterpart in physical 
eality. In ordinary quantum mechanics these states are declared inadmissible by the 
mposition of a principle. This superselection rule has dramatic physical consc- 
iences: for instance, particles of half-integral spin cannot be created or destroyed 
ingly, or more generally, in odd numbers (because of the way angular momenta 
idd; see Section 17.5 as well as Section 24.5). 

The general theory of angular momentum presents us with all the possible ways 
n which state vectors may transform under rotation in three dimensions. It does not, 
>f course tell us which of these possibilities are realized in nature. We have already 
^countered two actual and important examples: the orbital angular momentum, 
1 = L, and the spin angular momentum, J = S = ha l 2, of electrons, protons, 
leutrons, quarks, and so on. Both of these vector operators satisfy the commutation 
elations for angular momentum. They correspond to the values j = l — 0, 1, 2, . . . 
ind j = s = 1/2, respectively. Generalizing these notions, we now identify as an- 
gular momentum any observable that is represented by a generator J (in units of h) 
>f infinitesimal rotations. In order to apply the theory, we must know something 
ibout the nature of the particular physical system under consideration. We must 
enow the observables that describe it and how they behave under rotation. Thus, in 
he case of orbital angular momentum (Chapter 11), we were dealing with the trans- 
formation of a function i/> of the position coordinates x, y, z, or r, <p, 9, and we were 
ed to the study of spherical harmonics. In the case of the spin (Chapter 16), we 
ieduced the behavior of two-component spinors under rotation from the physical 
connection between the intrinsic angular momentum and magnetic moment, and 
Tom the vectorial character of these quantities. Other, more complex examples of 
ingular momentum will appear shortly. 

It should be stressed that three-dimensional Euclidean space, with its own to- 
cology, underlies the theory with which we are concerned here. Quantum systems 
cf lower or higher dimensionality may require qualitatively different treatments. For 
example, a system that is confined to two dimensions may have structural charac- 
teristics that allow a physical distinction between 2nrk rotations — the integer winding 
number k being an appropriate index. Even in three dimensions, we can conceive of 
idealized physical systems other than point particles (e.g., perfect solid or rigid 
bodies) for which it is meaningful to distinguish between odd and even multiples of 
277 rotations. 3 We will return to this point in Section 17.4. 

A symmetry principle like the Euclidean principle of relativity not only circum- 
scribes the geometric structure of quantum mechanics, but also has important dy- 
namical consequences, notably certain conservation laws, by which the theory can 
be tested experimentally. Although the same ideas are applicable to almost any sym- 
metry property, to be explicit we will frame the discussion in terms of conservation 
of angular momentum which is a result of rotational invariance. 

We assume that the dynamical system under consideration is characterized by 
a time-independent Hamiltonian, and in the Schrodinger picture evolves in time from 
its initial state |^(0)) according to the equation of motion, 

ih j t |¥(f)> = H(a)\V(t)) (17.9) 

The Hamiltonian depends on the dynamical variables that describe the system, and 
by adding the parameter a we have explicitly allowed for the possibility that the 


3 Merzbacher (1962). 



2 The Rotation Operator, Angular Momentum, and Conservation Laws 


415 


system may be acted upon by external forces, constraints, or other agents that are 
not part of the system itself. The division of the world into “the system” and “the 
environment” in which the system finds itself is certainly arbitrary. But if the cut 
between the system and its surroundings is made suitably, the system may often be 
described to a highly accurate approximation by neglecting its back action on the 
“rest of the world.” It is in this spirit that the parameter a symbolizes the external 
fields acting on what, by an arbitrary but appropriate choice, we have delineated as 
the dynamical system under consideration. 

We have seen that a rotation induces a unitary transformation U R of the state 
vector of the system. If we insist that the external fields and constraints also partic- 
ipate in the same rotation, a new Hamiltonian H(a R ) is generated. The Euclidean 
principle of relativity in conjunction with the principle of causality asserts that, if 
the dynamical system and the external fields acting on it are rotated together, the 
two arrangements obtained from each other by rotation must be equivalent at all 
times, and U R | M/XO) must be a solution of 

ih j C/*|TO> = //(«*)£/* |^(0> (17.10) 

at 

If the symmetry transformation itself is time-independent, comparison of (17.10) 
with (17.9) yields the important connection 

H{a R )U R = U R H(a ) (17.11) 

If the symmetry transformation is time-dependent, compatibility of Eqs. (17.9) and 

(17.10) requires 

c)Tl 

H(a R ) = U R H(a)U R + ih —f t/J (17.12) 

ot 

It frequently happens that the effect of the external parameters on the system 
is invariant under rotation. In mathematical terms, we then have the equality 

H(a R ) = H(a) (17.13) 

Hence, if the symmetry operator is time-independent, U R commutes with H. Since, 
according to (17.7), U R is a function of the Hermitian operator J, the latter becomes 
a constant of the motion, as defined in Chapter 14. Indeed, the present discussion 
parallels that of Section 14.4, where the connection between invariance properties 
and conservation laws was discussed in general terms. Conservation of angular mo- 
mentum is thus seen to be a direct consequence of invariance under all rotations. 

As an important special case, the condition (17.13) obviously applies to an 
isolated system, which does not depend on any external parameters. We thus see 
that the isotropy of space, as expressed by the Euclidean principle of relativity, 
requires that the total angular momentum J of an isolated system be a constant of 
the motion. 

Frequently, certain parts and variables of a system can be subjected separately 
and independently to a rotation. For example, the spin of a particle can be rotated 
independently of its position coordinates. In the formalism, this independence ap- 
pears as the mutual commutivity of the operators S and L which describe rotations 
of the spin and position coordinates, respectively. If the Hamiltonian is such that no 
correlations are introduced between these two kinds of variables as the system 
evolves, then they may be regarded as dynamical variables of two separate subsys- 
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terns. In this case, invariance under rotation implies that both S and L commute 
separately with the Hamiltonian and that each is a constant of the motion. The 
nonrelativistic Hamiltonian of a particle with spin moving in a central-force field 
couples L with S, and as we saw in Section 16.4, includes a spin-orbit interaction 
term proportional to L • S : 

p 1 2 3 

H — ~ — b V(r) + W(r)L • S 
2m 

If the L • S term, which correlates spin and orbital motion, can be neglected in 
a first approximation, the zero-order Hamiltonian commutes with both L and S, and 
both of these are thus approximate constants of the motion. However, only the total 
angular momentum, J = L + S, is rigorously conserved by the full Hamiltonian. 
We will see in Chapter 24 that in the relativistic theory of the electron even the free 
particle Hamiltonian does not commute with L or S. 


Exercise 17.1. Discuss the rotational symmetry properties of a two-particle 
system, with its Hamiltonian, 


H= + Vflr, - r 2 |) 

2 fTl\ 


(See Section 15.4.) Recall the expression (15.95) for the total angular momentum 
in terms of relative coordinates and the coordinates of the center of mass, and show 
that if the reduced mass is used, the standard treatment of the central-force problem 
in Chapters 11 and 12 properly accounts for the exchange of angular momentum 
between the two particles. 


Exercise 17.2. How much rotational symmetry does a system possess, which 
contains a spinless charged particle moving in a central field and a uniform static 
magnetic field? What observable is conserved? 

3. Symmetry Groups and Group Representations. Because of the paramount im- 
portance of rotational symmetry, the preceding sections of this chapter were devoted 
to a study of rotations in quantum mechanics. However, rotations are but one type 
of many symmetry operations that play a role in physics. It is worthwhile to intro- 
duce the general notion of a group in this section, because symmetry operations are 
usually elements of certain groups, and group theory classifies and analyzes system- 
atically a multitude of different symmetries that appear in nature. 

A group is a set of distinct elements a, b, c , . . . , subject to the following four 
postulates: 

1. To each ordered pair of elements a , b, of the group belongs a product ba 
(usually not equal to ab ), which is also an element of the group. We say that the 
law of group multiplication or the multiplication table of the group is given. The 
product of two symmetry operations, ba, is the symmetry operation that is equivalent 
to the successive application of a and b, performed in that order. 

2. ( ab)c — a(bc), i.e., the associative law holds. Since symmetry operations are 
usually motions or substitutions, this postulate is automatically satisfied. 

3. The group contains an identity element e, with the property 


ea = ae = a for every a 
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4. Each element has an inverse, denoted by a \ which is also an element of 
the group and has the property 

aa~ x = a~ x a = e 

All symmetry operations are reversible and thus have inverses. 

For example, rotations form a group in which the product ba of two elements 
b and a is defined as the single rotation that is equivalent to the two successive 
rotations a and b. By a rotation we mean the mapping of a physical system, or of a 
Cartesian coordinate frame, into a new physical system or coordinate frame obtain- 
able from the old system by actually rotating it. The term rotation is, however, not 
to be understood as the physical motion that takes the system from one orientation 
to another. The intervening orientations that a system assumes during the motion 
are ignored, and two rotations are identified as equal if they lead from the same 
initial configuration to the same final configuration regardless of the way in which 
the operation is carried out. In the rotation group, generally, ab A ba. For instance, 
two successive rotations by tt/2 — one about the x axis and the other about the 
y axis — do not lead to the same overall rotation when performed in reverse order. 
The operation “no rotation” is the identity element for the rotation group. 


Exercise 17.3. Use Eq. (16.62) to calculate the direction of the axis of rotation 
and the rotation angle for the compound rotation obtained by two successive rota- 
tions by 7t/2 about the x and y axes, respectively. Show that the result is different 
depending on the order in which the two rotations are performed. Convince yourself 
of the correctness of this result by rotating this book successively by 90° about two 
perpendicular axes. 


Exercise 17.4. Show that the three Pauli spin matrices, cr x , cr y , cr z , supple- 
mented by the identity 1 do not constitute a group under matrix multiplication, but 
that if these matrices are multiplied by ±1 and ±i a set of 16 matrices is obtained 
which meets the conditions for a group. Construct the group multiplication table. 


A symmetry operation a transforms the state I^P) of a system into a state ] 1 I r a ). 
It was shown in Section 17.1 that under quite general conditions this transformation 
may be assumed to be either unitary, |\P a ) = U a \ ''P), or antilinear. 

We assume here that the symmetry operations of interest belong to a group 
called a symmetry group of the system, which induces unitary linear transformations 
on the state vectors such that, if a and b are two elements of the group, 


U ba = U b U a 


(17.14) 


When (17.14) is translated into a matrix equation by introducing a complete set of 
basis vectors in the vector space of ['P), each element a of the group becomes 
associated with a matrix D(a) such that 


D(ab) = D(a)D(b) 


(17.15) 


That is, the matrices have the same multiplication table as the group elements to 
which they correspond. The set of matrices D(a ) is said to constitute a (matrix) 
representation of the group. Thus far in this book the term representation has been 
used mainly to describe a basis in an abstract vector space. In this chapter, the same 
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term will be used for the more specific group representation. The context usually 
establishes the intended meaning, and misunderstandings are unlikely to occur. 

A change of basis changes the matrices of a representation according to the 
relation 

D(a) = S~ l D(a)S (17.16) 


as discussed in Section 9.5. From a group theoretical point of view, two represen- 
tations that can be transformed into each other by a similarity transformation S are 
not really different, because the matrices D(a) obey the same multiplication rule 
(17.15) as the matrices D(a). The two representations are called equivalent , and a 
transformation (17.16) is known as an equivalence transformation. Two represen- 
tations are inequivalent if there is no transformation matrix S that takes one into the 
other. Since the operators U a were assumed to be unitary, the representation matrices 
are also unitary if the basis is orthonormal. In the following, all representations D(a) 
and all transformations S will be assumed in unitary form. 

By a judicious choice of the orthonormal basis, we can usually reduce a given 
group representation to block structure, such that all the matrices D(a ) of the rep- 
resentation simultaneously break up into direct sums of smaller matrices arrayed 
along the diagonal: 


D(a) = 


l Dfa) 0 0 

0 D 2 (a) 0 

0 0 DM 

\ • 




(17.17) 


We suppose that each matrix of the representation acquires the same kind of block 
structure. If n is the dimension of D, each block £> lt D 2 , ... is a matrix of dimension 
n u n 2 , ... , with n x + n 2 + . . . = n. It is then obvious that the matrices D 1 by 
themselves constitute an npdimensional representation. Similarly, D 2 gives an 
n 2 -dimensional representation, and D { is an « r dimensional representation. The orig- 
inal representation has thus been reduced to a number of simpler representations. 
The state vector (Hilbert) space has been similarly decomposed into a set of sub- 
spaces such that the unitary operators U a reduce to a direct sum 


u a = up © u™ © . . . 

with each set of operators satisfying the group property (17.14). 

If no basis can be found to reduce all D matrices of the representation simul- 
taneously to block structure, the representation is said to be irreducible. Otherwise 
it is called reducible. Apart from an unspecified equivalence transformation, the 
decomposition into irreducible representations is unique. (If two essentially different 
decompositions into irreducible subspaces were possible, the subspace formed by 
the intersection of two irreducible subspaces would itself have to be irreducible, 
contrary to the assumption.) There is therefore a definite sense in stating the irre- 
ducible representations (or irreps in technical jargon), which make up a given re- 
ducible representation. Some of the irreducible matrix representations may occur 
more than once. 

It is clearly sufficient to study all inequivalent irreducible representations of a 
group; all reducible representations are then built up from these. Group theory pro- 
vides the rules for constructing systematically all irreducible representations from 
the group multiplication table. Which of these are relevant in the analysis of a par- 



3 Symmetry Groups and Group Representations 


419 


ticular physical system depends on the structure of the state vector space of the 
system. 

The usefulness of the theory of group representations for quantum mechanics 
and notably the idea of irreducibility will come into sharper focus if the Schrodinger 
equation 7/|' v I f ) = £'| A P') is considered. A symmetry operation, applied to all eigen- 
states, must leave the Schrodinger equation invariant so that the energies and tran- 
sition amplitudes of the system are unaltered. The criterion for the invariance of the 
Schrodinger equation under the operations of the group is that the Hamiltonian com- 
mute with U a for every element a of the group: 


[H, UJ = 0 


(17.18) 


In Section 17.2 the same condition was obtained by applying symmetry requirements 
to the dynamical equations, and the connection between conservation laws and con- 
stants of the motion was established. By studying the symmetry group, which gives 
rise to these constants of the motion, we can learn much about the eigenvalue spec- 
trum of the Hamiltonian and the corresponding eigenfunctions. 

If E is an n-fold degenerate eigenvalue of the Hamiltonian, 

H\k) = E\k) (k = 1, 2, ..., n) (17.19) 

the degenerate eigenvectors | k) span a subspace and, owing to (17.18), 

HU a \k) = U a H\k ) = EU a \k) 

Thus, if |&) is an eigenvector of H corresponding to the eigenvalue E, then U a \k) is 
also an eigenvector and belongs to the same eigenvalue. Hence, it must be equal to 
a linear combination of the degenerate eigenvectors, 


n 

u a \k) = 2 I j)D Jk {a) (17.20) 

j=i 

where the D Jk (a ) are complex coefficients that depend on the group element. Re- 
peated application of symmetry operations gives 

n n n 

U b u a \k) = 2 U b \j)D jk (a) = 2 £ | t)D ej (b)Dj k (a) (17.21) 

j= 1 J= 1 €=1 


But we also have 


n 

U ba \k) = 2 I *)D ek (ba) (17.22) 

<= i 

By the assumption of (17.14), the left-hand sides of (17.21) and (17.22) are identical. 
Hence, comparing the right-hand sides, it follows that 


n 


D ek (ba) = ^ D ( j(b)D jk (a) 
J= i 


(17.23) 


This is the central equation of the theory. It shows that the coefficients D jk (a ) define 
a unitary representation of the symmetry group. If a vector lies entirely in the 
n-dimensional subspace spanned by the n degenerate eigenvectors of H, the opera- 
tions of the group transform this vector into another vector lying entirely in the same 
subspace, i.e., the symmetry operations leave the subspace invariant. 
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Since any representation D of the symmetry group can be characterized by the 
rreducible representations that it contains, it is possible to classify the stationary 
tates of a system by the irreducible representations to which the eigenvectors of H 
>elong. A partial determination of these eigenvectors can thereby be achieved. The 
abels of the irreducible representations to which an energy eigenvalue belongs are 
he quantum numbers of the stationary state. 

These considerations exhibit the mutual relationship between group theory and 
juantum mechanics: The eigenstates of (17.19) generate representations of the sym- 
netry group of the system described by H. Conversely, knowledge of the appropriate 
symmetry groups and their irreducible representations can aid considerably in solv- 
ng the Schrodinger equation for a complex system. If all symmetries of a system 
ire recognized, much can be inferred about the general character of the eigenvalue 
ipectrum and the nature of the eigenstates. The use of group theoretical methods, 
ipplied openly or covertly, is indispensable in the study of the structure and the 
ipectra of complex nuclei, atoms, molecules, and solids. The Schrodinger equation 
'or such many-body systems is almost hopelessly complicated, but its complexity 
:an be reduced and a great deal of information inferred from the various symmetry 
)roperties, such as translational and rotational symmetry, reflection symmetry, and 
lymmetry under exchange of identical particles. 

The observation of symmetric patterns and structures, as in crystals and mole- 
:ules, suggest the use of finite groups, i.e., transformation groups with a finite num- 
aer of elements. Often details about forces and interactions are unknown, or the 
heory is otherwise mathematically intractable, as in the case of strongly interacting 
fiementary particles (quantum chromodynamics). However, the dynamical laws are 
mderstood to be, subject to certain general symmetry principles, such as invariance 
mder rotations, Lorentz transformations, charge conjugation, interchange of iden- 
ical particles, “rotation” in isospin space, and, at least approximately, the operation 
)f the group SU(3) in a three-dimensional vector space corresponding to intrinsic 
iegrees of freedom. The irreducible representations of the groups which correspond 
o these symmetries provide us with the basic quantum numbers and selection rules 
: or the system, allowing classification of states, without requiring us to solve the 
complete dynamical theory. 

The utility of group representations in quantum mechanics is not restricted to 
systems whose Hamiltonian exhibits perfect invariance under certain symmetry 
ransformations. Although the symmetry may only be approximate and the degen- 
eracy of the energy eigenstates can be broken to a considerable degree, the states 
nay still form so-called multiplets, which under the action of the group operations 
ransform according to an irreducible representation. Thus, a set of these states can 
ie characterized by the labels, or “good quantum numbers,” of the representation 
vith which they are identified or to which they are said to “belong.” An under- 
standing of the pertinent symmetry groups for a given system not only offers con- 
siderable physical insight, but as we will see in Sections 17.7 and 17.8, can also 
simplify the technical calculation of important matrix elements in theory. For ex- 
ample, the orbital angular momentum operator L 2 , whose eigenvalues €(€ 4- l)h 2 
vill be seen in the next section to label the irreducible representations of the group 
if rotations of the position coordinates alone, commutes with the Hamiltonian of an 
fiectron in an atom that is exposed to an external uniform magnetic field (but not 
an electric field) and, also with the spin-orbit interaction. Therefore, the quantum 
lumbers € characterize multiplets in atomic spectra. 
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The continuous groups that are of particular interest in quantum mechanics are 
various groups of linear transformations, conveniently expressible in terms of ma- 
trices, which — like the rotations in Section 16.3 — can be generated by integration 
from infinitesimal transformations. The elements of such Lie groups are specified 
by a minimal set of independent, continuously variable real parameters — three, in 
the case of ordinary rotations — and the corresponding generators. The algebra of 
the generators, interpreted as (Hermitian) matrices or operators with their charac- 
teristic commutation relations, is the mathematical tool for obtaining the irreducible 
representations of these groups. An important category of continuous groups are the 
so-called semi-simple Lie groups, which are of particular physical relevance and 
also have attractive mathematical properties (analogous to the richness of the theory 
of analytic functions). Examples of important semi-simple Lie groups are the 
n-dimensional groups 0(n) of the real orthogonal matrices; their subgroups SO(n ) 
composed of those matrices that have a determinant equal to + 1 (with the letter S 
standing for special)-, and the special unitary groups SU(n). In Chapter 16 we saw 
that the rotation group R( 3), which is isomorphic to 0(3), is intimately related to 
SU(2). This connection will be developed further in Section 17.4. 

Exercise 17.5. An n-dimensional proper real orthogonal matrix SO(n), i.e., a 
matrix whose inverse equals its transpose and which has determinant equal to unity, 
can be expressed as exp(X), where X is a real-valued skew- symmetric matrix. Show 
that the group of special orthogonal matrices SO(n ) has n(n — l)/2 independent real 
parameters. (Compare Exercise 16.3.) Similarly, show that the group SU(n) has 
n 2 — 1 real parameters. 

4 . The Representations of the Rotation Group. The representations of the ro- 
tation group R( 3), which is our prime example, are generated from the rotation 
operator (17.7), 

Ur = ex p(“^ ‘ ( 17 - 4 * * 7 ) 

The rotations in real three-dimensional space are characterized by three independent 
parameters, and correspondingly there are three Hermitian generators, J x , J y , J z , of 
infinitesimal rotations. They satisfy the standard commutation relations for the com- 
ponents of angular momentum, (17.8). The eigenvectors of one and only one of 
them, usually chosen to be J z , can serve as basis vectors of the representation, thus 
diagonalizing J z . In other Lie groups, the maximum number of generators that com- 
mute with each other and can be simultaneously diagonalized is usually greater than 
one. This number is called the rank of the group. The groups 0(4) and 517(4) both 
have rank two. 

The central theorem on group representations is Schur’s (second) Lemma: 

If the matrices D(a) form an irreducible representation of a group and if a 
matrix M commutes with all D(a), 

[M, D(a)] = 0 for every a (17.24) 

then M is a multiple of the identity matrix. 

This result encourages us to look for a normal operator C which commutes with all 
the generators, and thus with every element, of the given symmetry group. The aim 
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> to find an operator C whose eigenvalues can be used to characterize and classify 
le irreducible representations of the group. If the operator has distinct eigenvalues 
x, c 2 , . . . , it can by a suitable choice of the basis vectors be represented in diagonal 
3rm as 

! Cl l 0 0 

„ 0 c 2 l 0 

0 0 . 

0 0 0 

/here the identity matrices have dimensions corresponding to the multiplicities (de- 
eneracies) of the eigenvalues c u c 2 , , respectively. In this basis, all matrices 
epresenting the group elements are reduced to block structure as in (17.17). 

Exercise 17.6. Show that if D(a) commutes with C, the matrix elements of 
)(a) which connect basis vectors that belong to two distinct eigenvalues of C (e.g., 
i + c 2 ) are zero. 

If the reduction to block structure produces irreducible representations of the 
;roup, the eigenvalues c u c 2 , ... , of the operator C are convenient numbers ( quan - 
um numbers ) which may serve as labels classifying the irreducible representations, 
lince, depending on the nature of the vector (Hilbert) space of the physical system 
nder consideration, any particular irreducible representation may appear repeatedly, 
dditional labels a are usually needed to identify the basis vectors completely and 
miquely. 

In general, more than one operator C is needed so that the eigenvalues will 
irovide a complete characterization of all irreducible representations of a symmetry 
;roup. For the important class of the semi-simple Lie groups , the rank of the group 
s equal to the number of mutually commuting independent Casimir operators suf- 
icient to characterize all irreducible representations. For the rotation group R( 3) of 
ank one, the Casimir operator, which commutes with every component of J, is 
hosen to be the familiar square magnitude of the angular momentum operator, 

J 2 = J 2 X + J 2 y + J\ = ^ (/+/_ + J-J+) + J\ (17.26) 

vith eigenvalues j(j + l)h 2 . The nonnegative integral or half- integral angular mo- 
nentum quantum number j fully characterizes the irreducible representations of the 
otation group in three dimensions. (In four dimensions, two Casimir operators and 
heir quantum numbers are needed.) As is customary, we choose the common ei- 
genvectors of J 2 and J z as our basis vectors and denote them by j ajm ). Since the 
piantum numbers a are entirely unaffected by rotation, they may be omitted in some 
>f the subsequent formulas, but they will be reintroduced whenever they are needed. 

From Section 11.2 we copy the fundamental equations: 

J z \j m) = m h\j m ) 

J+\j m) = V(; ~ m)(j + m + l)h\j m + 1) (17.27) 

/_ |y m) = V(j + m)(j — m + l)h | j m - 1) 

The vector space of the system at hand thus decomposes into a number of disjoint 
2 j + l)-dimensional subspaces whose intersection is the null vector and which are 


o\ 

0 

0 

■I 


(17.25) 
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invariant under rotation. An arbitrary rotation is represented in one of the subspaces 
by the matrix 


Dm'miR) = (jm'\U R \jm) = (jm'\ exp I — ^ n • J<j>\\jm) 


(17.28) 


Owing to (9.63) and (9.64), the representation matrix can be written as 

D°\R) = ex P^- “ n • J (17.29) 

where now J stands for the matrix whose elements are ( jm' \ i\jm). 

The simplicity of (17.28) and (17.29) is deceptive, for the components of J 
other than J z are represented by nondiagonal matrices, and the detailed dependence 
of the matrix elements of D U) (R) on the quantum numbers and on the rotation pa- 
rameters n and (f> is quite complicated. For small values of j, we can make use of 
the formula (10.31) to construct the rotation matrices in terms of the first 2 j powers 
of the matrix n • J : 


j j Lfi 

D (J \R) = 2 e-*+ El — 

k=-j k’ = -j (K 

- n J 

— k)h 

k’ =tk 



(17.30) 


Exercise 17.7. Using (17.30) and the explicit form of the angular momentum 
matrices, work out the rotation matrices for j = 0, 1/2, and 1. 


Exceptional simplification occurs for the subgroup of two-dimensional rotations 
about the “axis of quantization,” the z axis if J z is chosen to be diagonal. For such 
special rotations 

D$M) = e- im ^8 m ^ m (17.31) 

The representation matrices are also simple for infinitesimal rotations, i.e., when 
<f> « 1. In this case, by expanding (17.28) to first order in e = cf) n, and applying 
(17.27), we get 


D% m {en) = 8 m ^ m - l —--- Sy V(; - m)(j + m + 1) 


m+ 1 


(17.32) 


ie x Sy 


V(y + m)(j - m + 1) - ie z m8 m ^ n 


A further diagonalization of all rotation matrices is clearly impossible, since diag- 
onalizing, for instance, J x would necessarily undo the diagonalization of J z . Hence, 
for a given value of j, the irreducibility of the representations just obtained is ver- 
ified. All continuous unitary irreducible representations of the rotation group are 
obtained by allowing j to assume the values j = 0, 1/2, 1, 3/2, .... 

Again we must comment on the half-integral values of j. From the point of view 
of infinitesimal rotations, these are on the same footing as the integral values, as is 
confirmed by (17.32). However, when finite rotations are considered, an important 
distinction arises, for upon rotation about a fixed angle by (f> = 2i r, (17.31) shows 
that the matrix D 0) (27r) = — 1 is obtained. Yet, the system has been restored to its 
original configuration, and the resulting rotation is equivalent to the operation “no 
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rotation at all.” Hence, for half-integral values of j, a double-valued representation 
of the group is produced: to any rotation R correspond two distinct matrices differing 
by a sign. By the strictest definition, these Z) y) matrices do not constitute a repre- 
sentation of the rotation group. Instead, as we saw in Chapter 16, they represent the 
unitary unimodular group in two dimensions, SU( 2). This group is said to be the 
universal covering group of the rotation group. With this understanding, we can 
safely regard them as double-valued representations of the rotation group. They 
appear naturally in the reduction of the matrix representing the rotation operator U R . 
Owing to the usual phase ambiguities of quantum mechanics, they have their place 
in the theory, provided that U R contains only double-valued representations. 

Exercise 17.8. Show that the representation matrices D (1/2 \R ) are equivalent 
to the matrices U R in Eq. (16.62). 


The general expression for the representation matrices D(R) is most conven- 
iently obtained if the rotation is parametrized in terms of the three Euler angles, a, 
f3, y, rather than in terms of n and $. As indicated in Figure 17.1, a: is the angle of 
a rotation about the z axis of the fixed frame-of-reference. This rotation carries the 
y axis into the y" direction, and the second rotation by the angle /3 takes place around 
this nodal line. This rotation carries the z axis into the z' direction, defined by the 
azimuthal angle a and the polar angle /3. The final rotation is by the angle y about 
the new z' axis. Hence, the complete rotation operator is 


U R = expl -- yz' 


J exp -- (3y" ■ J exp -- az ■ J 


h 


h 


(17.33) 



Figure 17.1. Euler angles a (rotation about the z axis), j 8 (rotation about the y" axis or 
nodal line), and y (rotation about the z' axis), transforming xyz into x'y’z' ■ All 
intermediate axes are shown as dashed lines. The Euler angles a and /S are the spherical 
polar coordinates of the z' axis with respect to the xyz coordinate system (see Figure 11.3). 
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It is preferable to transform the right-hand side into a product of rotations about the 
fixed axes x, y, z. By inspection of Figure 17.1, it is easy to see that 




(17.34) 


and 


e -<jih) y r-3 — e -m)Py"J e -«/Vyz-J e <.MW"J 

Substitution of (17.34) and (17.35) into (17.33) yields the simple result 

JJ — — g ~ 0!ti)aJ Z g ~ (‘/h)PJyg ~ (i!h)yJ z 


(17.35) 

(17.36) 


The matrix elements of the irreducible representation characterized by j are then 


13, y) 


= {jm' | e~ (iimJ y e~ m ' )yJ *\jm) 

= e~ iam ' e~ iym {jm ’ [ e - <i,h)l3J y \ jm) 


= e~ iam 'e~ iym (jrn' | exp 


J + - 7_ 
2 h 


I jm) 


(17.37) 


We leave the details of working out explicit formulas for the representation matrices 
for general values of j to the myriad of specialized treatises on angular momentum 4 
and the rotation group, but we note two particularly useful symmetry relations for 
the matrix elements: 


D$ m *(a, (3, y) = £>&«(-% ~j8, - a) (17.38) 

(3, y) - (- 1)— 'D« m% _ m (a, (3, y) (17.39) 


Exercise 1 7.9. Using the unitary property of U R , show that the inverse rotation 
i? _1 is generated either by first rotating by —y around z,', then by — f3 around y', 
and finally by — a around z, or equivalently first rotating by — a around z, then by 
— j8 around y, and finally by — y again around z. Prove the symmetry relation (17.38). 
[The relation (17.39) is related to time reversal symmetry and can be proved at the 
end of the chapter.] 

Exercise 17.10. Work out the rotation matrices in terms of Euler angles for 
j = 0, 1/2, and 1, and compare the results with the results of Exercise 17.7. 


If j is integral, j = t, we can establish the connection between the rotation 
matrices and the eigenfunctions of orbital angular momentum. Expressed in terms 
of a (single particle) coordinate basis, the eigenvectors of J Z (L Z ) and J 2 (L 2 ) can be 
chosen to be the spherical harmonics: 

(r|€m) = 17(0, cp) (17.40) 

A rotation transforms an eigenvector | im) of L z into an eigenvector | im)' of L z >, 
with the same values of i and m, z' being the new axis obtained by the rotation 
from the z axis. Hence, 


i M' = U R \tm) - 2 | Zm')D% m {R) 

m' 


(17.41) 


4 Rose (1957), Brink and Satchler (1968), Biedenharn and Louck (1981), Thompson (1994). See 
also Feagin (1993), Chapters 16-19, for a computer-oriented approach. 
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and, if this is multiplied on the left by | r), 


e 

m' = — e 


(17.42) 


Here 9, <p, and 9', <p' are coordinates of the same physical point. 

Equation (17.42) can be inverted by using the unitary property of the represen- 
tation: 


t 

Y?(9, <p) = 2 D%l*(R)Y?'(9', <p') (17.43) 

m'=-€ 

Consider in particular a point on the new z' axis, 9' — 0. Using (11.98), we find 
the connection between the D matrices and spherical harmonics: 


D<ti*(R) = 


47 T 


2 € + 1 


1703 , a) 


(17.44) 


where (3, a are the spherical polar coordinates of the new z 1 axis in the old coordinate 
system (see Figure 17.1). If this is substituted into (17.42) for m = 0, we get 

I 4 * 

Y%9’, *') = JWTT 7 S 17(3. 9)17*03, «) (17.45) 

Y Zt + 1 m=-t 

By (11.99), we finally retrieve the addition theorem for spherical harmonics, 
( 11 . 100 ): 

P t ( cos 9’) = S Y?*Q 3, a)Y?(9, <p) (17.46) 

Zt "T 1 m=-t 


5. The Addition of Angular Momenta. If two distincf physical systems or two 
distinct sets of dynamical variables of one system, which are described in two dif- 
ferent vector spaces, are merged, the states of the composite system are vectors in 
the direct product space of the two previously separate vector spaces. The mathe- 
matical procedure was outlined in Section 15.4. A common rotation of the composite 
system is represented by the direct product of the rotation operators for each sub- 
system and may, as an application of Eq. (15.82), be written as 



- n • J x 0 1 ® expl — — n • J 2 4> 


exp 


-jh ■ (3 1 <g> 1 + 1 ® J 2 )0 
n 

(17." 


7 ) 


The operator 


J - Ji ® 1 + 1 ® J 2 


often written more simply, if less accurately, as 

J = Ji + J 2 


(17.48) 

(17.49) 


is the total angular momentum of the composite system. We sometimes say that J x 
and J 2 are coupled to the resultant J. 
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Exercise 17.11. Prove Eq. (17.47) and show that Ji (x) 1 commutes with 

/ (8) J 2 


An important example of adding commuting angular momentum operators was 
already encountered in Section 16.4 and again touched upon in Section 17.3, where 
L and S were combined into J. The present section is devoted to the formal and 
general solution of adding any two commuting angular momenta. 

Since each component of J! commutes with each component of J 2 , which sep- 
arately satisfy the usual commutation relations for angular momentum, the total 
angular momentum J also satisfies the angular momentum commutation relations: 

Ui, Jj] = ihe ijk J k (17.50) 

The problem of the addition of two angular momenta consists of obtaining the 
eigenvalues of J z and J 2 and their eigenvectors in terms of the direct product of the 
eigenvectors of J lz , J\ and of J 2z , Jl The normalized simultaneous eigenvectors of 
the four operators J?, j\, J lz , J 2z , can be symbolized by the direct product kets, 


= IjiWj) (g) | j 2 m 2 ) = \j x m x )\j 2 m 2 ) 


(17.51) 


rhese constitute a basis in the product space. From this basis we propose to construct 
:he eigenvectors of J z , J 2 which form a new basis. 

The operators J 2 and Jl commute with every component of J. Hence, 

U z , J?] = U z , Jl] = [J 2 , Jl] = [J 2 , Jl] = 0 


ind the eigenvectors of J z and J 2 can be required to be simultaneously eigenvectors 
if Jl and Jl as well. (But J 2 does not commute with J lz or J 2z \) In the subspace of 
:he simultaneous eigenvectors of Jl and Jl with eigenvalues j x and j 2 , respectively, 
we can thus write the transformation equation 


\j\hjm) = S \jij 2 in 1 m 2 )(j i j 2 m ] m 2 \j l j 2 jm) 
m \ m 2 


(17.52) 


connecting the two sets of normalized eigenvectors. The summation needs to be 
carried out only over the two eigenvalues m x and m 2 , for j\ and j 2 can be assumed 
;o have fixed values. Thus, the problem of adding angular momenta is the problem 
if determining the transformation coefficients 

rhese elements of the transformation matrix are called Clebsch-Gordan or Wigner 
zoefficients or vector-addition or vector-coupling coefficients. When no confusion is 
ikely, we simplify the notation and write 

(hh™\m 2 \j\j 2 jm) 55 {m x m 2 \jm) 

or it is understood that j x and j 2 are the maximum values of m x and m 2 . 5 Commas 
will be inserted between quantum numbers in a Clebsch-Gordan coefficient only if 
:hey are needed for clarity. As an abbreviation, we will refer to the Clebsch-Gordan 
coefficients as C-G coefficients. 


s For a complete discussion of Wigner coefficients and the many notations that are in use for them 
ind their variants, see Biedenharn and Louck (1981). 
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If the operator J z — J Zi 4- J Z2 is applied to (17.52) and if the eigenvalue con- 
ditions 


Jz \j\jij™) = mh\j x j 2 jm) 

(J Z] + Jz 2 )\jij 2 m im 2 ) = (m l 4- m 2 )h\j l j 2 m 1 m 2 ) 

are used, it can immediately be concluded that the m quantum numbers are subject 
to the selection rule : 

\jj 2 jm) = 0 if m # m 1 + m 2 (17.53) 

Applying 7_ and J+ to (17.52), we obtain the following recursion relations for the 
C-G coefficients: 

V(y ± m)(j T m + l)(m 1 m 2 |;, m + 1) - V/i + nhXTi ± m 1 + l)(m,±l, m 2 \jm) 

+ V0‘ 2 + m 2 )(j 2 ± m 2 + 1 )(m u m 2 ±l\jm) (17.54) 

Exercise 17.12. Derive the recursion relation (17.54), as indicated. 


To appreciate the usefulness of these equations, we set m x = j\ and m — j in 
(17.54), using the upper signs. Owing to the selection rule (17.53), for nontrivial 
results m 2 can have only the value m 2 = j — ji — 1, and we find that 

V2 / (ji, j - j x - 1 \j, j - 1> = V(j 2 - j + Jl + l)(J 2 + j ~ jl) On j ~ jl\jj) 

(17.55) 

Hence, if On j ~ j 1 \ j j > is known, On j ~j 1 - 1 \j, j ~ 0 can be determined. From 
these two C-G coefficients we can then calculate Oi ~ 1, j — ji |;, j — 1) by again 
applying (17.54), but this time using the lower signs and setting m x = j u 
m 2 = j ~ j n m = j — 1 . 

Continuing in this manner, we can use the recursion relations (17.54) to give 
for fixed values of j u j 2 , and j all the C-G coefficients in terms of just one of them, 
namely, 

(jJiju j ~ ji | jijij j > (17.56) 

The absolute value of this coefficient is determined by normalization (see below). 
From the selection rule (17.53) we deduce that the C-G coefficient (17.56) is dif- 
ferent from zero only if 

“A ^j~ ji ~ 72 

this being the range of values of m 2 . Hence,,/' is restricted to the range 


ji ~ 72 ^ j ^ jl + 72 

But since y - ! and j 2 are on a symmetric footing, we could equally well have expressed 
all C-G coefficients in terms of (jj 2 , j — j 2 , j 2 \jJ2j j ) and then have concluded that 
j must be resticted by the condition 

72 ~ ji ^ j ^ ji + 72 

The last two inequalities together show that the three angular momentum quantum 
numbers must satisfy the so-called triangular condition 


\ji ~ ji | — 7 ^ ji + 72 


(17.57) 
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i.e., the three nonnegative integer or half-integers j u j 2 , j must be such that they 
could constitute the three sides of a triangle. Since m — m x + m 2 ranges between 
—j and +j, it follows that j can assume only the values 


j = h + hi II + k ~ 1. • • • . l7i ~ A 1 + l,L/i ~ h 


(17.58) 


Hence, either all three quantum numbers j\, j 2 , j are integers, or two of them are 
half-integral and one is an integer. 


Exercise 17.13. Show that a symmetry exists among the three quantum num- 
bers ji, j 2 , j, and that in addition to (17.57) they satisfy the equivalent relations 

1 j ~ h I ^ j\ — j + h and \j - j t | < j 2 < j + j 1 

We observe that for fixed values of j\ and j 2 (17.52) gives a complete new basis 
in the (2j 1 + 1)(2 j 2 + l)-dimensional vector space spanned by the kets \j\j 2 mim 2 ). 
Indeed, the new kets \jij 2 j m), with j given by the allowed values (17.58), are again 
(2/, + 1 )( 27 2 + 1) in number, and being eigenkets of Hermitian operators they are 
also orthogonal. Since the old and new bases are both normalized to unity, the C-G 
coefficients must constitute a unitary matrix. Furthermore, it is clear from the re- 
cursion relations for the C-G coefficients that all of them are real numbers if one of 
them, say {jij 2 ,ji,j — j\ \j 1 j 2 j j), is chosen real. If this is done, the C-G coefficients 
satisfy the condition 

2 (jij 2 tn l m 2 \j l j 2 j m)(j } j 2 m- l m 2 \j : j 2 j > m l ) = 8 mm .8 jr (17.59) 

mj m 2 

or inversely 

2 (jih m \ m 2 \jij2j m )(jij 2 m i m 2 \jij 2 j m ) = S mimi S m2ra , (17.60) 

jm 

The double sums in these two equations can be substantially simplified by the use 
of the selection rule (17.53). Condition (17.59) in conjunction with the recursion 
relations determines all C-G coefficients except for a sign. It is conventional to 
choose the sign by demanding that the C-G coefficient (jij 2 , j 1 , j — 7i \ j 1 j 2 j j ) be 
real and positive. Extensive numerical tables and computer codes are available for 
C-G coefficients. 6 

Without proof, we note the most useful symmetry relations for C-G coefficients: 

(j\hmym 2 \j^j 2 j m) = (-l) j ~ jl ~ h (j 2 jim 2 m 1 \jjjJ m) = {j 2 j u -m 2 , ~m { \j 2 jj, -m) 

(17.61) 


Exercise 17.14. Calculate the C-G coefficients needed to couple the two an- 
gular momenta j x = 3/2 and j 2 — 1 to the possible j values, and express the coupled 
states | jf 2 j m ) in terms of the uncoupled states \jij 2 niin 2 ). 


Exercise 17.15. Verify the values of two special C-G coefficients: 


(jW\jijj) 



and (j2j0\j2jj) 


j( 2j ~ 1 ) 

(j + 1)(2 j + 3) 


(17.62) 


6 See Thompson (1994) for references. 
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Determine the value of the trivial C-G coefficient ( j0m0\j0jm ). How does it depend 
on the value of ml 


When we deal with spin one-half particles, the C-G coefficients for j 2 = 1/2 
are frequently needed. From the recursion relations (17.54), the normalization con- 
dition (17.59), and the convention that { ji , 1/2, j u j — ji\ju 1/2, j, j) shall be real 
and positive, we obtain the values 


(ju 1/2, m - 1/2, 1/2 1 y'j 1/2, ji ± 1/2, m) = ± 


(ju 1/2, m + 1/2, - 1/2 Ijj, 1/2, A ± 1/2, m) = 

since the allowed j values are j = j l ± 1/2. 

Exercise 17.16. Work out the results (17.63) from the recursion relations for 
C-G coefficients and the normalization and standard phase conditions. 


Ij i -+- m + 1/2 
I' 2Ji + 1 


j i ± m + 1/2 
I 2 ji + 1 


If the coordinate representation is used for the eigenstates of orbital angular 
momentum L and if, as usual, the eigenstates of S z , 


a — 



and 



are chosen as a basis to represent the spin, the total angular momentum J = L + S 
and its eigenvectors may be represented in the direct product spin (x) coordinate 
basis. We will denote the common eigenstates of J z and J 2 by °D J e m . Using the C-G 
coefficients (17.63) with j x = Z, we have 


qyjm — eij(- 1/2, m 


1 (±Vt ±m + \ F“- 1/2 \ 
Viz + 1 V Vz + m + \ r? +m ) 


(17.64) 


for the eigenstates with j = Z ± 1/2. These coupled eigenstates can be expected to 
play an important role in the quantum mechanics of single-electron and single- 
nucleon systems. 

Exercise 17.17. Apply 

J 2 = L 2 + S 2 + 2L • S = L 2 + S 2 + L + S- + L-S + + 2 L Z S Z (17.65) 

to Eq. (17.64) and verify that is an eigenstate of J 2 . 

The formulas (17.63) are also useful when the commuting spins of two particles 
are added, or coupled, to give the total spin 

S = S 1 + S 2 (17.66) 

If the two particles have spin one-half, the direct product spin space of the complete 
system is four-dimensional, and a particular basis is spanned by the uncoupled ei- 
genvectors of S lz and S 2? : 

a l ® a 2 = a l&2> a l ® P2 ~ a 1^2, Pi ® a 2 = fil a 2> Pi ® P2 ~ PlPl 

(17.67) 
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By letting j 2 = 1/2 in the expressions (17.63), we obtain the appropriate C-G co- 
efficients, allowing us to write the coupled eigenstates of S z and S 2 in the form: 

(17.68) 

and 



If we write the eigenvalues of S 2 in the usual form h 2 S(S + 1), the total spin quantum 
numbers S — 0 and S — 1 characterize two irreducible representations of R( 3) and 
the eigenstates of S 2 . The state (17.68) corresponds to S = 0 and is called a singlet 
state. The three states (17.69) correspond to S — 1 and are said to be the members 
of a triplet ; they are, successively, eigenstates of S z with eigenvalues +h, 0, and 
—ft. 

Exercise 17.18. Starting with the uncoupled basis (17.67), work out the 4X4 
matrices S z and S 2 , and show by explicit diagonalization that the singlet and triplet 
states are the eigenvectors with the appropriate eigenvalues. 

6. The Clebsch-Gordan Series. The direct products of the matrices of two rep- 
resentations of a group again constitute a representation of the same group. The 
latter is usually reducible even if the original two representations are irreducible. 
The product representation can then be reduced, or decomposed, giving us generally 
new irreducible representations. 

For the rotation group in three-space, Eqs. (17.47) and (17.48) show that the 
problem of reducing the direct product representation D <Jl> (R) (x) D (j2) (R) is inti- 
mately related to the problem of adding two commuting angular momenta, 
J = Jj + J 2 . Under rotations, the state vector in the (2 j 1 -I- 1) X (2 j 2 + l)-dimen- 
sional direct product space transforms according to D Ul) (x) D <j2 \ and the reduction 
consists of determining the invariant subspaces contained in this space. Since the 
irreducible representations are characterized by the eigenvalues of J 2 , the integral 
or half-integral quantum number j, which can assume values between |jj — j 2 \ and 
j i + j 2 in integral steps, labels the several representations into which the direct 
product representation decomposes. It follows from the last section that in the re- 
duction each of these irreducible representations of the rotation group appears once 
and only once. Formally, this fact is expressed by the equivalence 

D (Ji+j2> 0 0 0 

0 D <Jl+h ~ l) 0 0 

0 0.0 
0 0 0 D\ Jl ~ h 

In particle physics, it is customary to identify the irreducible representations by 
their dimensionality. For the rotation group, which has (except for equivalence trans- 
formations produced by changing bases) precisely one irreducible representation of 
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each dimension, the boldface number 2j + 1 thus characterizes an irreducible rep- 
resentation unambiguously. The relation (17.70) can then be concisely, if symboli- 
cally, written as 

(2j. + 1) ® (2j, + 1) = [2(ji + j 2 ) + 1] © [2(j, + i - 1) + 1] © , |7 ... 

...@[2[j, -j 2 | + 1] 1 ’ 

Exercise 17.19. Show by explicit counting that the matrices on both sides of 
(17.70) have the same dimension and that (17.71) is correct. Verify this for a few 
examples by letting j x and j 2 assume some simple values, e.g., 0, 1/2, 1, 3/2, or 2. 


The Clebsch-Gordan coefficients furnish the unitary transformation from the 
basis \jij 2 m im 2 ), in which the matrices and D Ul) and D <Jl) represent rotations, to the 
basis \jijzjm), in which D U) represents rotations. Hence, noting also that the C-G 
coefficients are real, we may write the equivalence (17.70) immediately and explic- 
itly as 

i\ +h 

Dmf m ,(R)Dmf m2 (R) = X E (hhm { m 2 \jihjm){j l j 2 m[m' 2 \jij 2 jm')D% m (R) 
j= IA ~h I mm ' 

(17.72) 


This expansion is called the Clebsch-Gordan series. As a useful application of this 
equation, we call on the identity (17.44) to obtain for j\ — € 1 , j 2 = j = € and 
appropriate choices for the m’s: 


• ^ (2€j + 1)(2€ 2 + 1) 

Y7,<e, m rt = s / l 4 - ^ + 1 r ~ x 


(f 1 f 2 00|f 1 f2^0)(€ 1 € 2 miW 2 | Wt + m 2 )Y™ l+m2 (d, <p ) 

(17.73) 


From this we find the value of the frequently used integral, 


I 


(2€ t + 1)(2€ 2 + 1) 

[Y%(d, <P)]*Y% (d, <p)Y%(6, <p)dn = J 4^3 + 1) X 

(£i € 2 00 1 'f 1 f 2 f’ 3 0}(f’ 1 £ 2 m x m 2 1 € x £ 2 £ 3 m 3 ) 


(17.74) 


Exercise 17.20. By use of the unitary condition for the D matrices and the 
orthogonality of the C-G coefficients, derive from (17.72) the linear homogeneous 
relation for the C-G coefficients: 

S (hh m '\ m 2 \khh^D ( ^ m ^K) = 2 D mf m ( R )(jij2m l rn 2 \jij 2 hm)D^ mi *(R) 

mo m,m 

(17.75) 

Show that for infinitesimal rotations this equation is identical with the recursion 
relations (17.54) for the C-G relations. 

7. Tensor Operators and the Wigner-Eckart Theorem. So far this chapter has 
been concerned mainly with the behavior under rotation of state vectors and wave 
functions. This section focuses on the rotational transformation properties of the 
operators of quantum mechanics and generalizes the concepts introduced in Sections 
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11.1 and 16.3. The operators corresponding to various physical quantities will be 
characterized by their behavior under rotation as scalars, vectors, and tensors. From 
a knowledge of this behavior alone, much information will be inferred about the 
structure of the matrix elements. Such information is useful in many applications of 
quantum mechanics. 

Let us suppose that a rotation R takes a state vector ^ by a unitary transfor- 
mation U R into the state vector = U R ty. A vector operator A for the system is 
defined as an operator whose expectation value (A) is a vector that rotates together 
with the physical system. It is convenient to consider the rotationally invariant scalar 
product (A) • e where e denotes an arbitrary vector that also rotates with the system 
into the vector e ' . Then we require that 

<^|A|^> • e = OTlAl'T) • e' (17.76) 

Since ''P is an arbitrary state, we infer the condition 

U r AUr ■ e = A • e' (17.77) 

Under a right-handed rotation parametrized by the axis n and the angle of rotation 
4>, we can, according to (16.36), express the relation between e and e' as 

§' = [1 + (1 — cos (f>) n X (n X) + sin n X] e (17.78) 

or in matrix form, using a Cartesian basis (x, y, z), 

3 

e’t = X Rijej (17.79) 

j= i 

where 

( (1 — n x ) cos 4> + nl n x n y ( 1 — cos <f)) — n z sin 4> n x n z( 1 — cos + n y s i n 4> 
n y n x ( 1 — cos <t>) + n z sin 4> (1 — n y ) cos <j> + n y n y n z ( 1 — cos <f>) — n x sin <f> 

n z n x ( 1 — cos (j > ) — n y sin n z n y { 1 — cos <f>) + n x sin (j> (1 — n z ) cos <p + n\ 

( 17 . 80 ) 

If (17.79) is substituted in (17.77), the condition for the components of a vector 
operator is seen to be 

UrAjUI = X AAj (17.81) 

i 

Exercise 17.21. To first order, work out the rotation matrix R for an infini- 
tesimal rotation characterized by the vector e = en, and check that this agrees with 
the result from formula (16.38). 

Exercise 17.22. By carrying out two successive rotations, verify that (17.81) 
is consistent with the properties of the rotation group and its representation by the 
R matrices. 

Again it is helpful to confine the discussion to infinitesimal rotations, since finite 
rotations can be generated by integration (exponentiation). If (17.78) is applied to 
an infinitesimal rotation by an angle 4> — s, and substituted in (17.77), we find that 

7 + ia-j)-A-axA 
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or 


[A, ft • J] = ihh X A 


(17.82) 


in agreement with the commutation relations (11.19) and (16.42). A vector operator 
A must satisfy (17.82) for arbitrary ft. Of course, J itself is a vector operator and 
satisfies (17.82) for A = J. 

Whether or not a vector A constitutes a vector operator for a system depends 
on the definition of the physical system and the structure of its angular momentum 
operator J. As an example, consider the case where the coordinate operators x, y, z 
provide a complete description of the dynamical system, such as a particle without 
spin. In this case, we can identify J with L = r X p. The quantities r, p, L are all 
vector operators for this system, as can be verified by using the fundamental com- 
mutation relations between r and p and checking the commutation relations (17.82). 
On the other hand, an external electric field E acting on the system does not in 
general make up a vector operator for this system — even though E is a vector. 
Neither condition (17.81) nor condition (17.82) is satisfied by such a field, as long 
as E is external to the system and not subject to rotation together with it. But E 
would become a proper vector operator if the system were enlarged so as to include 
the electromagnetic field in the dynamical description. This would result in a much 
more complicated operator J, and the commutation relations (17.82) would then be 
satisfied by E. The theory will be generalized along these lines in Chapter 23. 

If the system is a particle with spin, S becomes a vector operator provided that 
J is now taken to be L + S; that is, the spin wave function must be rotated together 
with the space wave function. 

A vector is a tensor of rank one. The left-hand side of (17.81), A/ = U r AjU r , 
can according to (9.128) be interpreted as the operator into which Aj is transformed 
by the rotation. If A } maps a state 'E into d>, A/ = U r AjU r maps U R ty into U R <£>. 
The Cartesian components of A' are linear combinations of the components of A, 
and the expansion coefficients R u are the matrix elements of the fundamental 
three-dimensional representation of the rotation group in a basis spanned by the 
Cartesian unit vectors. 

We generalize these ideas and define a tensor operator as a set of operators 
which, in analogy to (17.81), under rotation induce a representation of the rotation 
group. Generally, such a representation is reducible. (In particular, the usual Car- 
tesian tensors induce reducible representations if the rank of the tensor exceeds 1.) 
The reduction decomposes the space of the tensors into rotationally invariant sub- 
spaces, suggesting that we define as a building block of the theory the so-called 
irreducible spherical tensor operator of nonnegative integer rank k. This is a set of 
2k + 1 operators 7j?, with q = —k, —k+ 1 , k - 1, k, which satisfy the trans- 
formation law 


k 

U R Tt‘Ul = 2 n>D ( q %(R) 

q' = -k 


(17.83) 


As a special case (for k = 1), we may apply (17.83) to the (irreducible) vector 
operator r and compare the formula with (17.81). If we apply the rotational trans- 
formation of spherical harmonics, as given by Eq. (17.42), to the case k = 1 and 
employ the explicit form of the spherical harmonics Y™, from (11.82), we find 


x' + iy' 


x — ly 


x + iy x — iy 

. rz * 2 , _ rr 


D (l \R) (17.84) 
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where the representation matrix D (l) {R ) is the matrix (17.29) forj = 1. Hence, the 
Cartesian components of any vector operator A define an irreducible spherical tensor 
operator 7) of rank 1 by the relations 


Tt 1 = ~ 


A x + iA y 


V2 


n = a z . 


, A x - iA v 

rr 1 = 


V2 


(17.85) 


Exercise 17.23. Construct the 3X3 matrix for the transformation (17.85) and 
show that it is unitary. How does this matrix transform the rotation matrix R into 
D ( 1 ) (i?)? 


Armed with the definition of an irreducible tensor operator, we can now for- 
mulate and prove the Wigner-Eckart theorem, which answers the following question: 
If Tg (q = k, j — l, ... —k + 1, —k) is an irreducible tensor operator, how much 
information about its matrix elements in the angular momentum basis can be in- 
ferred? 

To find the answer, let us take the matrix element of the operator equation 
(17.83) between the states ( a'j'm ' | and | otjm). In addition to the angular momentum 
quantum numbers, the labels a and a' signify the totality of all quantum numbers 
needed to specify the basis states of the system completely. We obtain 

k 

{a'j'm' | U R TM | ajm) = 2 {a'j'm' \ Tj?' \ ajm)D%(R) 

q’ — ~k 


Using the definition (17.28) of the matrix elements of U R , we convert this equation 
into the set of simultaneous equations 


2 D«l.(R) {a’j'p'\T k 9 \ajp)D ( ^*(R) = 2 {a'j’m' \Tj?'\ajm)D%(R) (17.86) 


fX/X 


A glance at this equation and (17.75) shows that the two have exactly the same 
structure. If the D are known, the linear homogeneous relations (17.75) determine 
the C-G coefficients for given values of j\,j 2 ,j 3 , except for a common factor. Hence, 
(17.86) must similarly determine the matrix elements of the tensor operator. If we 
identify j = ji, k = j 2 , and j' = j 3 , we can conclude that the two solutions of these 
linear homogeneous recursion relations must be proportional: 


{a'j'm' 

T k 9 

otjm ) = {jkmq \jkj'm'){a 'j ' 

T k || aj ) 


This important formula embodies the Wigner-Eckart theorem. 

The constant of proportionality {a'j' || T k || aj) is called the reduced matrix 
element of the irreducible spherical tensor operator T k . It depends on the nature of 
the tensor operator, on the angular momentum quantum numbers j and j', and on 
the “other” quantum numbers a and a', but not on the quantum numbers m, m' , 
and q which specify the orientation of the system. 

The Wigner-Eckart theorem provides us with a fundamental insight, because it 
separates the geometric and symmetry -related properties of the matrix elements from 
the other physical properties that are contained in the reduced matrix element. Since 
the C-G coefficients are readily available, the theorem also has great practical value. 

From the fundamental properties (17.53) and (17.57) of the C-G coefficients, 
we infer the angular momentum selection rules for irreducible spherical tensor op- 
erators. The matrix element {a'j'm’ \Tg\ ajm) vanishes unless 

q — m' — m 


(17.88) 
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and the numbers, j, j', and k satisfy the triangular condition: 

\j-j'\^k<j+f\ (17.89) 

In particular, it follows that a scalar operator ( k = 0) has nonvanishing matrix ele- 
ments only if m — m' and j — j' . The selection rules for a vector operator (k = 1) 
are 

A m = m' — m = 0, ±1 and A j = j' — j = 0, ±1 
with j = j' = 0 excluded. 

Exercise 17.24. Derive the angular momentum selection rules for an irreduc- 
ible second-rank ( quadrupole ) tensor operator. 

Although (17.83) defines a tensor operator, to test a given set of 2k + 1 oper- 
ators for its rotational behavior and irreducibility, it is much easier and sufficient 
to check this condition for infinitesimal rotations. For these, (17.83) becomes in first 
order, 

k 

[A • J, V} = 2 Tj?\kq'\n ■ J\kq) (17.90) 

q' = -k 

where n points along the arbitrarily chosen axis of rotation. Using the familiar ex- 
pressions (17.27) for the matrix elements of the angular momentum components, we 
derive from (17.90) the standard commutation relations 

(17.91) 


Exercise 1 7.25. Show that by successive application of infinitesimal rotations 
(exponentiation) the argument leading from (17.83) to (17.90) can be reversed, sug- 
gesting (though not proving) the sufficiency of the commutation relations (17.90) or 
(17.91) as a test for an irreducible spherical tensor operator. 

Exercise 17.26. Show that the trace of any irreducible spherical tensor oper- 
ator vanishes, except those of rank 0 (scalar operators). 

Exercise 17.27. Show that the tensor operators S% — (— \yT k q ^ and T§ trans- 
form in the same way under rotation. Prove that 

(a'j || S k || aj) = (aj || T k || a'j)* 

and deduce the identity 

(jkmq\jkjm') = (-1 ) q {jkm' , - q\jkjm ) (17.92) 

Exercise 17.28. If and Tf are two irreducible spherical tensor operators of 
rank k, prove that the contracted operator 

+ k 

X (-W77* 

q=-k 


[J z , m = qhTg 

[J + , m = Vcik - q)(k + q + \)hrr l 
U-> rn = V(fc + q)(k - q + 1 )hrr 1 


(17.93) 
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is a tensor operator of rank zero, i.e., a scalar operator. For k — 1, show that this 
scalar operator is just the inner product S • T. 

Exercise 17.29. Prove that every irreducible spherical tensor operator of rank 
k can be expressed as a linear combination of the basic constituent tensor operators 

T£(a'j\ aj) = J \a'j'm')(jkmq\jkj'm')(ajm\ (17.94) 

mm ' 

& 

with the appropriate reduced matrix elements serving as the coefficients in the linear 
combination. 

Although the Wigner-Eckart theorem has been discussed in this section entirely 
in the context of rotations in ordinary three-dimensional space, the concepts devel- 
oped here are of great generality and can be extended to other symmetries and their 
group representations. 

8. Applications of the Wigner-Eckart Theorem. The static electric moments of 
a charge distribution, such as an atom or a nucleus, are examples of tensor operators. 
They are best defined in terms of the perturbation energy of the particles in an 
external electric field E = — Vf>. In first order, a particle of charge q contributes 

E — q J if/*(r)(f) (r)ifj(r) d 3 r (17.95) 

to the interaction energy. If the sources of the electric field are at large distances, <fi 
satisfies Laplace’s equation and may be written in the form 

oo + k 

H r) = 2 £ AfcVim <p) (17.96) 

k= 0 q= —k 

the origin being chosen at the center of mass of the atom or nucleus. The energy 
can therefore be expressed as the electric multipole expansion, 

E=\ p(r)<Kr) d 3 r = £ Ag f p(r)r k Yf d 3 r (17.97) 

At each order of k, the field is characterized by 2k + 1 constants Ag, and there is 
correspondingly a spherical tensor of rank k, r k Yi?(r), the electric 2 k -pole moment. 
Since spherical harmonics transform irreducibly under rotations in ordinary space, 
the components of these tensors obviously pass the test (17.91) for irreducible spher- 
ical tensor operators, provided that the rotations are generated by the operator 
L = r X p. Their matrix elements may therefore be evaluated by use of the Wigner- 
Eckart theorem. 

Exercise 17.30. Prove that the static 2*-pole moment of a charge distribution 
has zero expectation value in any state with angular momentum j < kl 2. Verify this 
property explicitly for the quadrupole moment by use of the formulas (17.62) and 
the Wigner-Eckart theorem. 

Exercise 17.31. For a particle in a central-force potential, with energy basis 
functions separable in radial and spherical polar coordinates, use (17.74) to evaluate 
the reduced matrix elements of the electric multiple moment operators. 
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Next, as an application of the Wigner-Eckart theorem to a vector operator A 
and to the special vector operator J, we use the relation 


{ a'j'm ' \A q \ ajm) 


( a'j ' 11 A || aj) 
( ot'j ' || J || aj) 


(a'j'm' \j q \ajm) 


(17.98) 


For j' = j this may be further developed by noting that by virtue of the simple 
properties of J we must have for the scalar operator J • A : 

(a'jm\i ■ A j ajm) = C(a'j || A || aj) 

According to the results derived in Section 17.7, the constant C must be independent 
of the nature of the vector operator A and of the quantum numbers m, a, and a ' . 
Hence, it can be evaluated by choosing a' = a and A = J: 

(ajm\j • J | ajm) = j(j + 1 )h 2 = C(aj |j J || aj) 

Substituting the reduced matrix elements of A and J from the last two equations 
into (17.98), we obtain the important result 


(a'jm' \A q \ajm) = ^ (jm'\j 9 \jm) 


(17.99) 


by which the y-diagonal matrix elements of a vector operator A are expressed in 
terms of the matrix element of the scalar operator J • A and other known quantities. 
Since the right-hand side of (17.99) can be loosely interpreted as the projection of 
A on J, this formula contains the theoretical justification for the vector model of 
angular momentum. It can be used as the starting point of the derivation of the matrix 
elements of magnetic moment operators that are important in spectroscopy. 

Exercise 17.32. Prove from the Wigner-Eckart theorem and formula (17.62) 
that the reduced matrix element of J is 

(aj' || J || aj) = Vj(j + 1 )hSjj. (17.100) 

* 

Contributions to the magnetic moment of an atom (or nucleus) arise from the 
orbital motion of the charged particles and the intrinsic spins in the system. Gen- 
erally, the magnetic moment operator of an atom may be assumed to have the form 

^ (8lL + gs S) (17.101) 

2 me 

where m is the electron mass. 7 Since both L and S are vector operators with respect 
to the rotations generated by the total angular momentum J = L + S, p is also a 
vector operator. 

According to the Wigner-Eckart theorem, all matrix elements of p in the an- 
gular momentum basis are proportional to each other. We therefore speak of the 
magnetic moment of the atom and, in so doing, have reference to the expectation 
value 

p = (ajjjpjajj) = (jljO\jljj)(aj || p || aj) = ^ II II «/> (17.102) 

7 Haken and Wolf (1993), Mizushima (1970). For an older, but very thorough, discussion, see 
Condon and Shortley (1935). 
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In the classical limit (j °°) the reduced matrix element becomes identical with 
the magnetic moment p. 

The application of (17.99) to the magnetic moment vector operator (17.101) 
gives immediately 


But 


P = (ajj\fjb z \ajj) = 2mhc J^ ^ («i/|gz.L • J + g s S • J | ajj) (17.103) 


2L J = J 2 + L 2 - S 2 and 2S • J = J 2 + S 2 - L 2 


Hence, 


P = + ~f) + #s)J 2 + (gL ~ £sX L2 “ S 2 ) | ajj) (17.104) 

If L-S coupling describes the state of the atom, for which the magnetic moment is 
to be evaluated, | ajj) is an approximate eigenstate of the total orbital and the total 
spin angular momenta, which are symbolized by a. Using the usual notation J, L, 
and S for the three angular momentum quantum numbers that characterize the atomic 
state, and the values g L = 1 and g s « 2 for the electrons, we obtain for the magnetic 
moment of the atom, 


P = 


ehJ 
2 me 


1 + 


J(J + 1) - L(L + 1) + S(S + 1) 
2 J(J +1) 


eh 
2 me 


gj J (17.105) 


The expression in the bracket (g 7 ) represents the general form for L-S coupling of 
the Lande g-factor. 


9 . Reflection Symmetry, Parity, and Time Reversal. The Euclidean principle of 
relativity may be supplemented by the further assumption that space has no intrinsic 
chirality, or handedness: Processes take place in the same way in a physical system 
and its mirror image, obtained from one another by reflection with respect to a plane. 
We may call this assumption the extended Euclidean principle of relativity. By def- 
inition, a reflection with respect to the yz plane changes x into —x, and leaves y and 
z unchanged; similarly, p x is transformed into — p x and p y and p z remain unchanged. 

In any test of the extended Euclidean principle of relativity, an important dif- 
ference between rotational symmetry and reflection symmetry must be remembered. 
In rotating a system from one orientation to another, we can proceed gradually and 
take it through a continuous sequence of rigid displacements, all of which are equiv- 
alent. This is not so in the case of reflection, since it is impossible to transform a 
system into its mirror image without distorting it into some intermediate configu- 
rations that are physically very different from the original system. It is therefore not 
at all obvious how quantities, seemingly unrelated to coordinate displacements, such 
as the electric charge of a particle, should be treated in a reflection if symmetry is 
to be preserved. Only experience can tell us whether it is possible to create a suitable 
mirror image of a complex physical system in accordance with the assumption of 
reflection symmetry. 

In developing the mathematical formalism, it is convenient to consider inver- 
sions through a fixed origin (r — » — r) instead of plane reflections. This is no 
limitation, since any reflection can be obtained by an inversion followed by a rota- 
tion. Conversely, an inversion is the same as three successive reflections with respect 
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to perpendicular planes. Corresponding to an inversion, there is a unitary operator 
Up with the following transformation properties: 

U P rU}, = -r, U P pUl = -p (17.106) 

If the system is completely described by spatial coordinates, these requirements are 
met by the operator U P , defined by 

U P \r') = | -r'> (17.107) 

Since two successive inversions are the same as the identity operation, we may 
require that 

U 2 P = 1 (17.108) 

choosing an arbitrary phase factor to be 1 . The eigenvalues of U P are ± 1 and are 
said to define the parity of the eigenstate. If the system is specified by r, the eigen- 
states are the states with even and odd wave functions, since 

Up j |r'} d 3 r' {<r'|¥> ± <-r'|^)} = ±J |r'> d 3 r' {<r'|^> ± <-r'|¥>} 

(17.109) 

In the context of one-dimensional systems, these properties of wave functions under 
reflection were discussed in Section 5.2. 

The Euclidean principle of relativity connects physical laws with geometry. We 
expect that rotations and inversions commute. If this is assumed, we have 

[Up, J] = 0 or UUUp = J (17.110) 

0 

confirming that J is an axial vector operator. It follows from (17.110) that the ei- 
genstates | jm) have definite parity and that all substates of a given; value have the 
same parity, since they are obtained by successive application of J + . These conclu- 
sions are illustrated by the discussion of the parity of spherical harmonics in Section 
11.4. Their parity is given by (— l) f . 

A m irror reflection in a plane whose normal is the unit vector n can be thought 
of as an inversion followed (or preceded) by a rotation by an angle ir around the 
axis n. Thus, the unitary operator describing such a mirror reflection is the product 

U P e~ (m * :tv = e~ (i/ii)h37r U P (17.111) 

If the system is a spin 1/2 particle, the inversion operator U P is a multiple of 
the identity, since J = (h/2)cr, and there is no other matrix that commutes with 
every component of o\ Hence, we may choose U P = 1 for the inversion. The operator 
for a mirror reflection, (17.111), reduces to 

e~ KH 2 )ft-«r _ (17.112) 


Exercise 17.33. Show that exp^^^j is (except for a phase factor) the reflec- 
tion operator for the coordinate x, if H is the Hamiltonian of a linear harmonic 
oscillator along the x axis with frequency co. 


Exercise 17.34. A rotation in three-dimensional space can be replaced by the 
product of two mirror reflections whose intersection is the axis of rotation. Working 
with the representation (17.112) of a reflection, construct the product of two such 
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reflections and show that it represents a rotation. Relate the angle of rotation to the 
angle between the mirror planes. 


If the Hamiltonian of a system is invariant under inversion, the parity operator 
U P is a constant of the motion, and all eigenvectors of H may be assumed to have 
definite parity. Parity is conserved for a particle in a central field even in the presence 
of a uniform magnetic field (Zeeman effect), but the presence of a uniform electric 
field causes states of opposite parity to be mixed (see Chapter 18). 

There are parity selection rules for the matrix elements of operators which trans- 
form simply under inversion. An even operator is characterized by the property 

U P AUp — A (17.113) 

and has nonvanishing matrix elements between states of definite parity only if the 
two states have the same parity. Similarly, 

U P BUp = —B (17.114) 


characterizes an odd operator, which has nonvanishing matrix elements between 
states of definite parity only if the two states have opposite parity. For instance, if 
— ex is the electric dipole moment operator, the expectation value of this operator 
is zero in any state of definite parity. More generally, an atom or nucleus in a state 
of definite parity has no electric 2 A -pole moment corresponding to odd values of k. 
If, as is known from the properties of the weak interactions, conservation of parity 
is only an approximate symmetry, small violations of this selection rule are to be 
expected. 

A system is said to exhibit symmetry under time reversal if, at least in principle, 
its time development may be reversed and all physical processes run backwards, 
with initial and final states interchanged. Symmetry between the two directions of 
motion in time implies that to every state ^ there corresponds a time-reversed state 
0^R and that the transformation 0 preserves the values of all probabilities, thus 
leaving invariant the absolute value of any inner product between states. 

From Section 17.1 we know that in determining 0, only unitary and antiunitary 
transformations need to be considered. The physical significance of 0 as the time 
reversal operator requires that, while spatial relations must remain unchanged, all 
velocities must be reversed. Hence, we postulate the conditions 

0r0 _I - r, 0p0" J = -p, 0J0 1 = -J (17.115) 

If the time development of the system is given by 

T'(r) = e ~ (i/h>Ht TO) 

time reversal symmetry demands that the time-reversed state 0^(0) evolves into 

@V(-t) = ©TO) 

From the last two equations we obtain the condition 


e ~(i/fi)Ht Q _ Q e W)Ht 


if the theory is to be invariant under time reversal. 

If © were unitary, condition (17.116) would be equivalent to 


(17.116) 


®H + H® = 0 
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f such an operator 0 existed, every stationary state F^ of the system with energy 
; would be accompanied by another stationary state 0F £ with energy — E. The 
hange of sign of the energy is in conflict with our classical notions about the be- 
avior of the energy if all velocities are reversed, and it is inconsistent with the 
xistence of a lower bound to the energy. Hence, 0 cannot be unitary. 

If 0 is assumed to be antiunitary so that 

©AF = A*0F (17.117) 

nd 

(0F a , ©F fe ) = (F fc ,FJ (17.118) 

Dr any two states, invariance under time reversal as expressed by (17.116) requires 
lat 


0i? — H® — 0 


(17.119) 


although the operator 0 commutes with the Hamiltonian, it is not a constant of the 
lotion because the equation of motion (14.18) holds only for linear operators. 


Exercise 17.35. Deduce the commutation relation (17.119) from (17.116). 

Obviously, a double reversal of time, corresponding to the application of 0 2 to 
11 states, has no physical consequences. If, as is the case in ordinary quantum 
lechanics, a one-to-one correspondence may be set up between physical states and 
idmissible) state vectors, with only a phase factor remaining arbitrary, 0 2 must 
itisfy the condition 

© 2 F = CF 

ith the same constant |C| = 1 for all F. From this condition, owing to the anti- 
aitary property of © the following chain of equalities flows: 

C'Qq/ — @2©>jr — @ 3 qr — @@ 2 \[r = ©OF — C*©F 

ence, C is real and either C = lorC = — 1, depending on the nature of the system. 
I e also see from the identity 

(0F, TO = (@F, © 2 F) = C(©F, F) (17.120) 

tat in the case C — — 1, 

^F) = 0 (17.121) 

lowing that for such systems time-reversed states are orthogonal. 

As a corollary, if C — - 1 and the Hamiltonian is invariant under time reversal, 
te energy eigenstates may be classified in degenerate time-reversed pairs. This 
■operty is known as Kramers degeneracy. 

For a particle without spin, the operator © is defined by its action on the co- 
•dinate basis: 

©|r') = |r'> (17.122) 

ince 0 is antilinear, it is represented by complex conjugation of the wave function: 

(r' 1 0'F) = <r'|T0* = 0*(r') (17.123) 
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in agreement with the conclusions reached in Section 3.5. 

Exercise 17.36. Derive (17.123) from (17.122). Formulate time reversal in 
the momentum representation. 


From the time reversal behavior of angular momentum J, which according to 
(17.115) anticommutes with 0, it is easy to infer that the basis vectors | ajm) trans- 
form under time reversal according to the relation 


@| ajm) = e ,s (—l) m \ aj, — m ) 


(17.124) 


where the real phase constant 8 may depend on j and a but not on m. By applying 
© again to Eq. (17.124) and using the antilinear property of 0, we find that 

© 2 1 ajm) = (— l) 2y | ajm) (17.125) 

Hence, 0 2 = +1 if; is integral, and 0 2 — — 1 if; is half-integral. Kramers degen- 
eracy implies that in atoms with an odd number of electrons (or nuclei with an odd 
number of constituent nucleons, or baryons with an odd number of quarks) the en- 
ergy levels are doubly degenerate. Since it does not destroy time reversal symmetry, 
this remains true even in the presence of a static electric field. A magnetic field 
violates time reversal symmetry and splits the degeneracy. 


Exercise 17.37. Prove Eqs. (17.124) and (17.125). Show that 0 commutes 
with the rotation operator U R — and use this information to derive the 

symmetry relation (17.39) for the representation matrices: 

D$ m *(R) = (-1 (17.39) 


Exercise 17.38. Show that for a particle with spin one-half in the usual a z 
basis a and f3, time reversal may be represented by cr y K , where K stands for complex 
conjugation. 

Exercise 17.39. Show that for a particle of spin one-half, orbital angular mo- 
mentum t, and total angular momentum ;, the eigenstates defined in (17.64) 
transform under time reversal into ±(—l) m the sign depending on whether 

; = € — 1/2 or ; = € + 1/2. 


Exercise 17.40. Show that the single-particle orbital angular momentum ei- 
genfunctions with quantum numbers € and m in the momentum representation are 
spherical harmonics. Show that the choice of phase implied by (p|€m) = i e Y™( p) 
leads to consistent and simple time reversal transformation properties for the angular 
momentum eigenfunctions in momentum space. Compare with the time reversal 
transformation properties of the angular momentum eigenfunctions in the coordinate 
representation. 

Although time reversal invariance does not lead to any conservation law, se- 
lection rules may be inferred just as from rotation and reflection invariance, because 
the important tensor operators usually have simple transformation properties under 
time reversal. An irreducible spherical tensor operator Tfl is said to be even or odd 
with respect to 0 if it satisfies the condition 


©W 1 = ±{-\yTk q 


(17.126) 
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'he + sign refers to tensors that are even under time reversal, and the — sign to 
msors that are odd. 

By taking the matrix element of the operator equation (17.126) between states 
f sharp angular momentum and using (17.124), as well as the antilinearity of 0, 
iq derive: 

(a'j'm'\T k q \ajm) = ±e i(s ~ s '\a'j ' , -m'\T k q \aj, -m)* 

f we confine our attention to matrix elements that are diagonal in a and j, it follows 
rom (17.126), the Wigner-Eckart theorem, and (17.61) that 

(aj || T k || aj) = ±(-l f(aj || T k || aj)* (17.127) 

f the reduced matrix element (aj || T k || aj) is real, it is zero for tensor operators 
tiat have odd rank k and that are even under time reversal. (It is also zero for tensor 
perators of even rank, which are odd under time reversal.) 

Exercise 17.41. Derive the results (17.126) and (17.127). 

Electric multipole moments are even under time reversal. Hence, in sharp an- 
;ular momentum states the expectation values of the electric dipole moment (k = 1) 
nd all other odd-rank electric moments vanish as a consequence of time reversal 
ymmetry. We have already noted that they also vanish for states of definite parity 
s a result of space reflection symmetry. We conclude that the observation of a static 
lectric dipole moment in a stationary state of definite angular momentum can be 
xpected only if both space reflection and time reversal symmetries are violated by 
he dynamical interactions. Since the discovery of parity violation in weak interac- 
ions, the search for experimental evidence for an electric dipole moment in the 
leutron has been motivated by the need to determine the effect of the expected small 
iolation of time reversal symmetry, which has its origin in the interactions of pan- 
icles produced in high-energy interactions. When quantum mechanics is extended 
o (relativistic) interactions, which cause the creation and destruction of particles, 
harge or particle-antiparticle conjugation (C) joins space reflection ( P ) and time 
eversal ( T ) as a third discrete symmetry. It is enforced by the strong and electro- 
nagnetic interactions, but violated in the realm of weak interactions. Some conti- 
nents will be found in Chapter 24. 

Exercise 17.42. Show that the symmetry operations 0 and U P commute and 
lerive the transformation properties of the fundamental dynamical variables (posi- 
ion, momentum, angular momentum) under the action of the combined inversion- 
ime reversal operation of the antilinear operator %U P . 

LO. Local Gauge Symmetry. The applications of symmetry in quantum mechanics 
reated in this chapter have so far been concerned with spacetime transformations 
)f the states. We conclude this chapter with a brief discussion of gauge symmetries. 
Qiese symmetries have come to dominate contemporary theories of particles and 
ields, but they can also be introduced in the framework of ordinary quantum me- 
ihanics. 

Local gauge invariance made its first appearance in Section 4.6, where we saw 
hat the dynamical equations of quantum mechanics for a charged particle in an 
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external electromagnetic field can be cast in gauge-invariant form if the wave func- 
tion and the potential are transformed jointly according to the following scheme: 

if,' = Uif, (17.128) 

- V - - A' = U\- V - - A ]f/ + , ih - - q<f) r = U[ih - - q<f>)u (17.129) 
i c \i c / dt \ dt / 

These relations can also be written as 

if] s* jfl 

A' = UAU f + — U(VU f ), <f>' = Ucf>U f U (17.130) 

q q dt 

We found this local gauge transformation to be implemented by the unitary operator, 

U = e iqf/hc (17.131) 

where /( r, t) is an arbitrary smooth function of the spacetime coordinates. The result 
of this gauge transformation is the mapping 


A i-> A' = A + V/, 


(f) ^ <f>' = <f> - -y, 

C dt 


if, i-> if,' = e iqf/hc ip 


(17.132) 


It has become customary to say that the principle of local gauge symmetry in quan- 
tum mechanics, or invariance of the theory under the transformation (17.128), re- 
quires the introduction of a gauge field (A, <f>), which obeys the rule (17.130) for 
gauge transformations. If gauge invariance is accepted as a fundamental law of phys- 
ics, the electromagnetic field can be regarded as a consequence of this symmetry. 
The local operators U represent the one-dimensional unitary group, U{ 1). This sym- 
metry group is commutative, or Abelian. 

Without any specific reference to electromagnetic interactions. Abelian gauge 
fields also arose in Section 8.6 when we considered the adiabatic approximation for 
molecules. By separately treating the fast electron motion, with the nuclear coor- 
dinates serving as slowly varying parameters, we found that the Schrodinger equa- 
tion (8.107) for the much heavier nuclei generally includes a gauge field. 

In the domain of physics where classical correspondence arguments are un- 
available for constructing the appropriate quantum mechanical laws, the imposition 
of local gauge symmetry serves as a powerful principle from which theories for 
interacting particles can be derived. The basic idea is similar to the scheme embodied 
in (17. 128)— (17. 132), generalized to particles that have multicomponent wave func- 
tions, corresponding to intrinsic degrees of freedom that are unrelated to their space- 
time transformation properties. Typical examples of such intrinsic coordinates are 
isospin or color “quantum numbers” labeling the states. 8 

To illustrate the consequences of the adoption of the gauge symmetry principle, 
we choose the generators of the non- Abelian group 5(7(2) as the observables asso- 
ciated with the intrinsic degrees of freedom. These observables are assumed to com- 
mute with all operators related to spacetime transformations, such as position, mo- 
mentum, and angular momentum. For purposes of identification, we call the intrinsic 
degree of freedom isospin, and for simplicity we assume that the particle has no 
ordinary spin. The wave function is now a two-component column matrix in isospin 


s For further descriptive comments, see Section 21.1. 
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>ace, analogous to the spin space of Chapter 16. We denote by r 1; r 2 , r 3 the three 
ermitian 2X2 isospin matrices, which are the generators of SU(2). They are 
lalogous to the Pauli matrices a x , a y , u z defined in Section 16.4, and their com- 
utation relations are 

[t;, Tj] = 2 is ijk T k (17.133) 

he traces of the matrices t u t 2 , t 3 are zero. Generalizing the previous scheme, we 
istulate invariance of the wave equation for the isospinor i^(r, t ) under the local 
luge symmetry operations induced by the group SU( 2). These operations are rep- 
isented by 

U = e m)a (17.134) 

here U is a unimodular unitary matrix, and 

1 3 

<*(r, 0 = r E 0 T i (17.135) 

4 ,-=i 

a traceless 2X2 matrix with an arbitrary smooth spacetime dependence. The 
iree real-valued local parameters a,(r, t) specify the element of the group. 

The relations (17.128)-(17.130) now generalize to 

tf/’(r, t ) = e (Uh)a(r ' t) t) (17.136) 

y V - G' = t/y V - G^ju\ ih J t ~ g’ = u(ih ~ - g^)c/ + (17.137) 

■, equivalently, 

« 

3f/ t 

G' - UGU f + ihU(VW), g 1 = UgU f - ihU (17.138) 

dt 

ere, G and g are a vector and a scalar in ordinary space, respectively, but they are 
X 2 matrices in isospin space. The gauge-covariant structures in Eqs. (17.137) are 
ie building blocks from which the gauge-invariant dynamical laws are constructed. 

For Lie groups like SU(n), it is sufficient to implement the symmetry conditions 
Y considering infinitesimal transformations (17.134), choosing « 1. For 

iese, (17.138) reduces to 


G' = G + Va + j [a, G], g' ~ g ~ ~~ + T K g\ (17.139) 
n dt n 

hese transformation equations for the gauge fields can be rewritten, if we decom- 
jse the matrices G and g, as 

3 3 

G = “ E G*t< and gi T i (17.140) 

£ i= 1 ^ i= 1 

istead of a single four-vector gauge field represented by the potentials (A, </>), we 
dw have three such fields, (G,-, g ( ), with i = 1, 2, 3. Their infinitesimal gauge 
ansformation properties are obtained by substituting (17.135) and (17.140) in 
7.139), and using the commutation relations (17.133) to give 


_ 1 

d a, 

1 

G* — G € + Vofj - - s { j k aj G fc , 

g( ge df 

^ s tjk a j gk 


(17.141) 
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These transformation equations show explicitly how the gauge fields must transform 
if they are to ensure local gauge invariance of the quantum mechanical wave equa- 
tion. The last terms on the right-hand sides arise from the non- Abelian character of 
the symmetry group and indicate the greater complexity of the new gauge fields in 
comparison with the electromagnetic field. 

Exercise 17.43. For an infinitesimal gauge transformation derive (17.139) 
from (17.138) and work out the transformation equations (17.141). 

The gauge-invariant Yang-Mills field equations for the field tensors derived 
from the gauge (vector potential) fields are analogous to Maxwell’s equations. How- 
ever, because of the non-Abelian character of the theory, they are nonlinear, exhibit 
self-couplings, and pose formidable mathematical difficulties. Further discussion is 
found in textbooks on quantum field theory. 9 Local gauge symmetries associated 
with intrinsic dynamical variables that transform as the generators of various sym- 
metry groups SU(n) are the basis of our present-day understanding of strong and 
weak interactions in particle physics. Similar ideas have also been found useful in 
many-body theories and applications to condensed-matter physics. From the per- 
spective of abstract differential geometry (topology), gauge fields are examples of 
connections, the derived field tensors are interpreted as curvatures, and the line 
integral of the gauge field over a closed loop (like Berry’s phase in Section 8.6) is 
a holonomy. 

Although in this chapter all symmetries were discussed in the context of non- 
relativistic quantum mechanics, the dynamic gauge symmetries as well as the purely 
geometric symmetries can be readily adapted to the formalism of relativistic quan- 
tum mechanics. The equations of this section, in particular, can be transcribed in 
Lorentz-covariant form without modification. This is not wholly surprising, since 
the electromagnetic field and its quantum, the massless photon, are intrinsically 
relativistic. 

Exercise 17.44. Defining for a non-Abelian gauge field theory the analogue 
B of the magnetic field as the 2X2 isospin matrix: 

B = - E B,r, = ( V - ~ G j X G = V X G - 7 G X G (17.142) 
2 i= i \ ft J ft 

prove that under a finite gauge transformation, 

B' - UBW (17.143) 

and that the definition (17.142) ensures that the “magnetic” energy, 

1 3 

- 2 B, • B* = trace(B • B) (17.144) 

2 ;=i 

is gauge invariant. Similarly, construct the analogue of the electric field E. 

Exercise 17.45. If instead of SU( 2) the group SU( 3) induces the local gauge 
transformations of the system, how many gauge fields would there be? (Refer to 
Exercise 17.5.) 


9 Gross (1993). For an elementary introduction, see Halzen and Martin (1984). 
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roblems 

In the notation of (17.64) the state of a spin one-half particle with sharp total angular 
momentum j, m, is: 

aWjl 1/2 + 1/2 

Assume this state to be an eigenstate of the Hamiltonian with no degeneracy other 
than that demanded by rotation invariance. 

(a) If H conserves parity, how are the coefficients a and b restricted? 

(b) If H is invariant under time reversal, show that alb must be imaginary. 

(c) Verify explicitly that the expectation value of the electric dipole moment 
— er vanishes if either parity is conserved or time reversal invariance holds (or both). 

A particle with spin one-half (lambda hyperon) decays at rest (€ = 0) into two par- 
ticles with spin one-half (nucleon) and spin zero (pion). (See Figure 17.2.) 

(a) Show that, in the representation in which the relative momentum of the decay 
products is diagonal, the final state wave functions corresponding tom = ±1/2 may, 
because of conservation of total angular momentum, be written in the form 

(p|i l) ~ A s a + A P { cos 9 a + e ,<p sin 9 /3) 

<p|i, “I) = A s /3 - A P ( cos 9 1 3 - e~ i<p sin 9 a) 

where 9 is the angle between the momentum vector p of the spin zero particle and 
the z-axis of quantization. (Neglect any interactions between the decay products in 
the final state.) ’ 

(b) If the initial spin one-half particle is in a state with polarization P ( f P [ < 1), 
show that the (unnormalized) angular distribution can be written as W — 1 + AP • p. 
Evaluate A in terms of A s and A p . 

From the results of Problem 2, construct the density matrix p for the spin state of the 

i d 

final spin one-half particle (nucleon). 


Pion 



gure 17.2. Kinematics of the decay of a spin-polarized hyperon at rest into a nucleon 
d a pion. The polarization vector is P. The magnitude of the relative momentum p 
uals that of the momenta of the two decay products in the rest frame of the hyperon. 
e unit vector p„ points in the direction of emission of the nucleon. 
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(a) Prove that the polarization of the spin one-half decay products can be written 
in the form 

<<t> = trace(pcr) = ^ [-(A - p„ ■ P)p„ + ^p„ X P + v(p n X P) X pj 

where p„ is the unit vector in the direction of emission of the spin one-half particle, 
and P denotes the initial polarization of the decaying particles. Determine the real- 
valued quantities p and v in terms of A s and A P . 

(b) Show that A 2 + p 2 + v 2 = 1. 

(c) Discuss the simplifications that occur in the expressions for the angular dis- 
tribution and final state polarization if conservation of parity or invariance under time 
reversal is assumed for the decay-inducing interaction. 

4. The Hamiltonian of the positronium atom in the IS state in a magnetic field B along 
the z axis is to gopd approximation 

H = AS t • S 2 + (5 1 ! — S 2 ) 

me 

if all higher energy states are neglected. The electron is labeled as particle 1 and the 
positron as particle 2. Using the coupled representation in which S 2 = (Si + S 2 ) 2 
and S z = S u + S 2z are diagonal, obtain the energy eigenvalues and eigenvectors and 
classify them according to the quantum numbers associated with constants of the 
motion. 

Empirically, it is known that for B — 0 the frequency of the 1 3 S — > l 1 ^ transition 
is 2.0338 X 10 5 MHz and that the mean lifetimes for annihilation are 10 10 s for the 
singlet state (two-photon decay) and 10 -7 s for the triplet state (three-photon decay). 
Estimate the magnetic field strength B which will cause the lifetime of the longer 
lived m — 0 state to be reduced (“quenched”) to 10“ 8 s. 

5. An alternative to the usual representation for states of a particle with spin one-half, 
in which the simultaneous eigenstates of r and c r z are used as a basis, is to employ a 
basis spanned by the simultaneous eigenstates of r and k = tr ■ r/2. Show that the 
operators S 2 , J 2 , J z , k commute and that their eigenfunctions may be represented as 

<r K'\jmK") D ( £.(cp, 0)8 K . K .. 

where <p, 0 denotes the rotation that turns the z axis into the direction of r. Can this 
representation be generalized to particles of higher spin? Can an analogous basis be 
constructed in the momentum representation? 10 

6. The magnetic moment operator for a nucleon of mass m n is jut = e(g ( L + g s S)/2m n c, 
where g { = 1 and g s = 5.587 for a proton, g ( = 0 and g s = —3.826 for a neutron. 
In a central field with an additional spin-orbit interaction, the nucleons move in shells 
characterized by the quantum numbers £ and j = £ ± 1/2. Calculate the magnetic 
moment of a single nucleon as a function of j for the two kinds of nucleons, distin- 
guishing the two cases j — £ + 1/2 and j — £ — 1/2. Plot j times the effective 
gyromagnetic ratio versus j, connecting in each case the points by straight-line seg- 
ments ( Schmidt lines). 

1. The angular momentum operator or generator of infinitesimal rotations may be rep- 
resented in terms of Euler angles, which specify the orientation of a rectangular 


10 The helicity operator a ■ p will be encountered in Chapter 24. See also Section 35.3 in Gottfried 
(1966). 
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coordinate system Ox'y'z' anchored in a rigid body (Figure 17.1). In this represen- 
tation, the components of angular momentum along the z, y", and z' axes are 


hd_ r _hd_ _ ft _3_ 

i da ’ y i 3/3 ’ 2 i by 


(a) Prove that the components of angular momentum along the x and y axes can 
be expressed in terms of the Euler angles as 


h 

Jy 


h ( . d d COS ad ' 

— ( —sin a cos a cot B 1 — : 

i \ 3/3 da sm (3 dy/ 


h 


sin a d 


— cos a — — sin a cot B 1 — : 

j t •y n r' «\ n ^ . . 


3/3 


da sin /3 dy j 


(b) Check the commutation relations for the operators J x , J y , J z . 

(c) Work out the operator J 2 in terms of the Euler angles. 

(d) Using relation (17.37), show that the matrix elements D^ m (a, /3, y) are ei- 
genfunctions of the differential operators J z , J Z ' and J 2 with eigenvalues m'h, mh, 
and £(€ + l)h 2 , respectively. 


In the quantum mechanical treatment of a rigid body with principal moments of in- 
ertia, Ii, / 2 , and / 3 , it is convenient to use Euler angles as the coordinates to specify 
the orientation of the system. If the z' axis is the axis of a symmetric top, its Ham- 
iltonian is 


H = ~ 3i) + 


Using the results of Problem 7, obtain the energy eigenvalues and the corresponding 
eigenfunctions.’ 

A system that is invariant under rotation is perturbed by a quadrupole interaction 

2 

V = 2 C q 7? 

q= 2 

where the C q are constant coefficients and T '§ are the components of an irreducible 
spherical tensor operator, defined by one of its components: 

^ - (7, + ijyf 

(b) Deduce the conditions for the coefficients C q if V is to be Hermitian. 

(b) Consider the effect of the quadrupole perturbation on the manifold of a de- 
generate energy eigenstate of the unperturbed system with angular momentum quan- 
tum number j, neglecting all other unperturbed energy eigenstates. What is the effect 
of the perturbation on the manifold of an unperturbed j — 1/2 state? 

(c) IfC ±2 = C 0 and C ±1 = 0, calculate the perturbed energies for a j = 1 state, 
and plot the energy splittings as a function of the interaction strength C 0 . Derive the 
corresponding unperturbed energy eigenstates. 
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In Section 8.3, we introduced the elements of perturbation theory for the 
time-independent Schrodinger equation. We are now ready to study in 
greater generality and more detail a successive approximation scheme for 
the discrete bound energy eigenstates of the Hamiltonian H, known as 
Rayleigh-Schrodinger perturbation theory. Other iteration schemes have 
been devised in the search for more effective convergent perturbation 
methods, especially in many-body theory, but the Rayleigh-Schrodinger 
version serves our goal: to obtain a manageable algorithm for approximate 
energy eigenvalues and stationary states and to illustrate the theory with 
applications in atoms. We also return briefly to the variational method for 
stationary states and its relation to perturbation theory. 

1. The Perturbation Method. The Rayleigh-Schrodinger perturbation method ap- 
plies to the discrete energy levels of a physical system whose Hamiltonian operator 
H can be partitioned into two Hermitian operators: 

H = H 0 + gV (18.1) 

Of these, H 0 will be regarded as the unperturbed part and gV as the perturbation. In 
(18.1), g is a real number, which will be used provisionally for bookkeeping pur- 
poses as an expansion parameter. We can let it become zero, in which case the 
Hamiltonian reduces to the unperturbed one, H 0 , or we may let it grow to its full 
value, which may be chosen as g — 1. (In Section 8.3, g was assumed to have its 
full value throughout, and therefore g did not appear explicitly. In later sections of 
this chapter, we will also set g — 1 .) 

The eigenvalues and eigenvectors of H are functions of g. Simple perturbation 
theory applies when these eigenvalues and eigenvectors can be expanded in powers 
of g (at least in the sense of an asymptotic expansion) in the hope that for practical 
calculations only the first few terms of the expansion need be considered. 

The eigenvalue problem we wish to solve is 

HV n = E n V n (18.2) 

We suppose that the related unperturbed eigenvalue problem, 

(18.3) 

has already been solved. Assuming, first, that no degeneracy has occurred here, let 
us inquire what happens to the energy eigenvalues and the corresponding eigenvec- 
tors as we allow g to grow continuously from zero to some finite value. In this 
process, the energy will change to 

E n = ET + A E n (18.4) 

and the eigenvector will change to 

(18.5) 
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jrturbation theory assumes that the changes A E n and AW n are small, where “small” 
ust yet be precisely defined. We substitute expressions (18.1), (18.4), and (18.5) 
to (18.2) and project the equation in the direction of the unperturbed eigenvector 
^ 0) . Using Eq. (18.3), we thus deduce for the energy shift of the nth level: 


A£„ = 


QP<°>, gVy n ) 

^n) 


(18.6) 


his is an exact relation, but it contains the unknown perturbed eigenvector in 
ie inner products on the right-hand side. 

The fundamental idea of perturbation theory is to assume that both the eigen- 
ilues and eigenvectors of H can be expanded in powers of the perturbation param- 
er g, and to determine the coefficients in the perturbation expansions: 

E n = E ( r f + gElf + g 2 E < 2) + . . . (18.7) 

id 

W n - ^< 0) + gW™ + g 2 V™ + ... (18.8) 

i writing (18.8), we have assumed that the unperturbed energy level, £* 0) , is non- 
egenerate. If there is degeneracy, the perturbation method that is based on the 
tpansion (18.8) is still appropriate, provided that for some reason (such as sym- 
letry properties) it is known beforehand to which specific state — *■ in the 

ibspace of the unperturbed eigenstates the perturbed state reduces as g 0. When 
ich knowledge is unavailable, the discussion must be refined and degenerate per- 
irbation theory used (Section 18.5). 

The expansion (18.8) of the eigenvector l 'P„ in powers of g is not unique, for if 
r n is a solution of (18.2), then ( a 0 + a x g + a 2 g 2 + . . .)^„, which differs from ''T„ 
y a constant factor (global gauge transformation), is an equally good solution, but 
s expansion in powers of g may be entirely different from (18.8). In spite of this 
pparent ambiguity in the eigenvectors, all physically observable quantities, such as 
ie energies (18.7) and other expectation values of operators calculated from M/„, 
ave unique expansions in g. 

If in (18.6) , \P‘„ is approximated by its leading term, the known unperturbed state 
r( f ) , we obtain the first approximation, 


A E„ 



Qgg, VT^ 0) ) 

(-v|f(0), ^r(0)^ 


(18.9) 


'hich, if the normalization is accounted for, is the same result as formula (8.33), 
lough in a different notation. 

To go beyond the first approximation in a systematic fashion, we substitute the 
vo expressions (18.7) and (18.8) into (18.2) and compare terms of equal power in 
to obtain the successive approximation equations: 

HJVf = £f^< 0) (18.10) 

+ VV ( n 0> = Ef^f + Efi^T (18.11) 

HfiV (2) + + Ef ) y { fi ) (18.12) 
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Of these equations, (18.10) is identical with (18.3) and gives nothing new. Equation 
(18.11) can be rewritten as 


(Ho - )¥< 1} = (E ( ,p - Wf 


(18.13) 


The right-hand side of this equation is known except for the value of E^\ the un- 
known vector stands on the left. Thus (18.13) is an inhomogeneous linear equa- 
tion for If We try to solve Eq. (18.13) for ' V F^ 1) by dividing by the operator 
H 0 — Ef \ we encounter an obstacle: H 0 — E^ has zero eigenvalues and is therefore 
not invertible. It is worthwhile to review the properties of equations like (18.13), 
since all successive approximations and many other problems of quantum mechanics 
lead to the same type of equation — whether they are formulated as linear equations 
in a vector space, as matrix equations, or as differential or integral equations. 

2. Inhomogenous Linear Equations. Assuming temporarily that the right-hand 
side is completely known, (18.13) is of the general type, 

Au = v (18.14) 

where A is a given Hermitian operator with a complete set of eigenvectors, v is a 
given vector, and u is sought. For our purposes, it is quite immaterial whether A is 
represented by a square matrix, in which case u and v are column matrices, or by a 
differential operator, so that u and v are ordinary functions. We can treat this equa- 
tion in a general abstract vector space, where u and v are vectors, thus encompassing 
many special cases simultaneously. But to most readers the theorems to be stated 
will be familiar from the theory of systems of simultaneous linear equations, where 
A is a finite dimensional square matrix. 

The alternatives to be distinguished are these: 

1 . Either the homogeneous equation 

Au' = 0 (18.15) 

possesses nontrivial solutions, that is, A has zero eigenvalues, or 

2. Equation (18.15) has no nontrivial solution. In this latter case (which cor- 
responds to det A =£0 for finite dimensional matrices), the operator A has a unique 
inverse A -1 (see Section 9.3). For any given v, the solution of (18.14) is uniquely 

u = A -, u (18.16) 

and the problem before us has been formally solved. 

In case 1, there is no linear operator inverse of A, because (18.15) cannot be 
inverted (since any operator acting on the null vector produces only the null vector). 
On the other hand, if in this case (18.14) has a particular solution f, 

Af = v (18.17) 

then any vector 

u = u' + f (18.18) 

will automatically also be a solution. By letting u' symbolize all solutions of the 
homogeneous equation (18.15), u of (18.18) represents all solutions of the inhomo- 
geneous equation, and it is called the general solution of (18.14). 
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In this case, however, the existence of a solution depends on a further necessary 
and sufficient) condition: v must have no component in the subspace spanned by 
he eigenvectors u' that correspond to zero eigenvalue, as seen from the equations 

(u' , v ) = (n\ Au ) = ( Au ' , u) — 0 (18.19) 


!f P denotes the projection operator that projects every vector into the subspace 
ipanned by the eigenvectors u ' , condition (18.19) can be written as 



(18.20) 


kVe will make essential use of this condition. 

Although guessing a particular solution / is perfectly acceptable, we can con- 
struct one by the following systematic procedure. By condition (18.20), we may 
write the equation to be solved, (18.17), as 


Af = (7 - P)v (18.21) 


which is seemingly a needless complication of the simple equation (18.17). Yet, 
ilthough A, which has zero eigenvalues, has no inverse, there exists a Hermitian 
Dperator K such that 



(18.22) 


The operator K is thus the inverse of A in the orthocomplement of the space of the 
vectors u ' . Infinitely many operators K satisfy this equation, because to any solution 
we may add an .operator PB ( B arbitrary) and still have a solution of (18.22). We 
remove this ambiguity and select a unique solution of (18.22) by imposing the sub- 
sidiary condition 

Ku' = 0 (18.23) 

The unique operator that is defined by (18.22) and (18.23) may symbolically be 
written as 

1 - P 

K = (18.24) 

A 

This expression is not intended to imply that A has an inverse, and it is not permis- 
sible to apply the distributive law to it and write it as A -1 — A~ 1 * * * * * * * * X P, or the like. In 
the context of the linear differential equations of mathematical physics, an operator 
like K can be represented by a Green’s function, G(r, r') — (r|£jr') (see Section 
13.3 and examples in the next section). 

The chain of equalities, 

A(Kv) = (AK)v = (i — P)v = v 

shows that 

f = Kv (18.25) 


is a particular solution of (18.14). We thus conclude that if Au' = 0 possesses 
nontrivial solutions, then Au = v has the general solution 


u — u' + Kv 


(18.26) 


nmwiHpH that P ji = fl Wp M/ill nnw nnnlv thpsp results tn np.rturhation theorv 
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Exercise 18.1. Show that the condition (18.23) implies that 

PK = KP = 0 (18.27) 

Exercise 18.2. For case 1 show that / = Kv is the unique particular solution 
of (18.14) that is orthogonal to the subspace of the u' . 


3 . Solution of * the Perturbation Equations. The first approximation of the 
Rayleigh-Schrodinger perturbation procedure has led us to the inhomogeneous equa- 
tion (18.13) or: 

(ET - H 0 )V™ = ~{E ( n l) - (18.28) 

Identifying the operator E < 0) — H Q - A, we see that the homogeneous equation has 
indeed nontrivial splutions which we now assume normalized to unity. We 
have case 1 of the last section before us. We fix our attention on a definite nonde- 
generate unperturbed state with a particular value of n, and we denote by P n the 
projection operator for the direction Condition (18.20) appears now as the 
equation 

p„(ze - wf = o 


But 


Hence, 


P — \Jf(0) 

£-0)^(0) = p n y, jr( 0 ) 


(18.29) 

(18.30) 


and, with the assumed normalization of 

E ( n l) = (^ 0) , P n Vyf ) \ 

Since P n is Hermitian, this first-order correction to the energy is thus (omitting g) 



(18.31) 


where V nn is a diagonal matrix element in the representation spanned by the unper- 
turbed energy eigenstates. It is the mean value of the perturbation in the unperturbed 
state, in agreement with the result of Section 8.3 and with (18.9). 

By inserting (18.31) into (18.28), we can solve the latter by comparing it with 
the general solution (18.26). Since is assumed to be the unperturbed eigenvec- 
tor, it is evident that the solution (18.28) is 


¥£> = - K n (JE™ - V)¥<® 

where C ( „ l) is an arbitrary constant. We must remember that the operator K n is defined 
by the conditions (18.22) and (18.23), or 


(£,? - H 0 )K n = 1 - P n and K n V®> = 0 


(18.32) 


Therefore, we have 

^(D = c uHp(0) + K n W* (18.33) 

It is simplest to set C^ 1} and all other arbitrary constants C®, which multiply fiF 0) in 
the jfet.h armroximatinn. ennal tn zero 
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With C = 0, i.e., = 0, we have 

V ( n l) - K n W™ 


(18.34) 


rhe operator K n is obtained by solving equations (18.32); many techniques are avail- 
able for this task. If (18.34) is used in '\P'£ )) + this approximate eigenvector 

s normalized to unity up to first order in g. 


Exercise 18.3. If is given by (18.34), prove that 

on 0) + g'K l \ v™ + g^ 1} ) = 1 + 0(g 2 ) 


The same procedure can be continued systematically as the perturbation theory 
s carried to higher orders. In second order, we must solve (18.12), or 

(£f - = ~{E^ - VW™ - (18.35) 

vhich is again of the inhomogeneous type. The homogeneous equation, obtained by 
•eplacing the right-hand side by zero, again has nontrivial solutions (case 1), ^^ 0) . 
Hence, we must require that the inhomogeneity have no component in the direction 
bf ' V P^ I) or that its inner product with vanish: 

OP< 0) , (E< 1} - V)¥< 1} + E^YT) = 0 

since is orthogonal to ^^ 1} , and T f ^ 0) is normalized to unity, we obtain the simple 
•elation 



(18.36) 


Armed with knowledge of E%\ we can now calculate the eigenvector correction 
Y {2} by applying the same method as before. We find 

V = CW - K n (E™ - 

\gain we choose C {2) = 0, i.e., PJY^P = 0. Substituting (18.34) forH^ and (18.31) 
:o replace E%\ we are finally able to write 'H r ^ 2) in terms of unperturbed eigenstates 
is 

- -E^KlV^f + K n VK n V'Y < ° ) = -KlVP n V'V ( °> + K n VK n V'Yf> (18.37) 

The perturbation theory can be further developed in this way to any desired 
brder by successive approximation. The equations rapidly become lengthy and cum- 
bersome, but diagrammatic techniques and algebraic computer programs can lighten 
:he work appreciably. 

If we choose all eigenstate corrections HZ’® for k > 0 to be orthogonal to the 


mperturbed or 

;he normalization 

on o> , = s ki 

3 (18.38) 



(^n\ = 1 

(18.39) 


lolds to all orders in g. Obviously, with these choices the perturbed state 1 J r „ is 
generally not normalized to unity. 
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According to Eqs. (18.10)-(18.12) and their higher-order sequels, the ortho- 
gonality condition (18.38) implies further than 



(18.40) 


The challenge of perturbation theory is to determine the operator K n from the im- 

perturbed Hamiltonian H 0 . 

& 

Exercise 18.4. Renormalize to unity and calculate to second order 

in g, 


z n = op„, - | en°\ z ]! 2 ^ n ) p 


(18.41) 


which is the probability of finding the perturbed system in the unperturbed energy 
eigenstate. Show to this order that the value of Z does not depend on the choice of 
the arbitrary constants C„\ 


Exercise 18.5. With the choice of and adopted in (18.34) and (18.37) 
(C™ = C (2) = 0), show that 

(¥„, VV n ) = E™ + 2 gE™ + 0(g 2 ) (18.42) 

Also evaluate (^„, flo^n) to second order in g. Does the sum of Hfffrn) and 
(¥„, gW n ) give E n to second order in g? Does Z„[(^ n , HfPf) + 0R„, gV'RJ] give 
E n to second order in g? 


Exercise 18.6. Obtain expressions for the third-order correction to the energy 
and the eigenvectors in terms of the lower-order corrections. 


To make this more concrete, we specialize to a particle moving in one dimen- 
sion with unperturbed potential V 0 (x) and appropriate boundary conditions, per- 
turbed by the potential V(x ) (with g = 1). In the coordinate representation, the 
inhomogeneous equation (18.28) becomes 

(£2 - f? V 0 (x) + ftj'ltiXx) = I? [V(x) ~ £^]</4 0) « (18.43) 

where we have set E ( f } = h 2 k 2 l2m. The operator K n is represented by the Green’s 
function 


G n (x, x') 


2m 


(x\ K n \x') 


(18.44) 


which, by (18.32), satisfies the differential equation 1 



2m 


V 0 (x) + k 2 n }G n (x, x') = 8(x - x') - ^\x)^*(x’) 


(18.45) 


This equation must be solved subject to the same boundary conditions as the original 
Schrodinger equation and the orthogonality constraint (18.23), or 


•+ 00 

ip ( f ) *(x)G n (x, x') dx = 0 

J — 00 


(18.46) 


See Courant and Hilbert (1953), Section V.14. 
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"he delta function on the right-hand side of (18.45) implies a discontinuity of the 
irst derivative of G(x, x ') at x = x', but the Green’s function itself is required to 
ie continuous. Equation (18.34) then turns into the integral expression 

f +0 ° 2m 

*A (1) « = G n (x, x ') V(x')^°\x') dx' (18.47) 


Exercise 18.7. Consider a particle in a one-dimensional box with V 0 (x) = 0 in 
he domain 0 < x < L. The unperturbed eigenfunctions are </^ 0) (x) = V2/Z sin k n x, 
nd the eigenvalues are k n L — mr (n = 1, 2, . . .). The Green’s function 
r n {x, x') = G„(x\ x) is 2 


r„(x, X') = 


1 

k n L 


x cos k n x sin k n x' — (L — x') cos k n x' sin k n x — — sin k n x sin k n x' 

2k 


or x' > x. If the perturbation is a delta-function potential at the center of the box, 
'(x) — ~g8 (x — LI 2), (a) show that only eigenstates with n = odd are perturbed, 
b) calculate the first-order correction to the energy eigenfunction at x = LI 2, and 
lerive the perturbation expansion for the energy, 


h 2 n 2 TT 2 2 g 
2 mL 2 L 


2tng 2 

h 2 n 2 TT 2 


+ 0(g 3 ) 


(18.48) 


f n is an odd integer. 

Exercise 18.8. Treat the problem of the particle in a one-dimensional box 
>f length L, centered at x = 0 with perturbation V(x) = — g8(x), exactly. Define 
l n — h 2 k' n z /2m and show that for the even-parity energy eigenfunctions the eigen- 
values k' n are the roots of the transcendental equation 

^ = arctanf— ) * (18.49) 

2 \ mg ) 

(a) Discuss the solutions of this equation graphically and, by use of the power 
eries expansion of arctan x in inverse powers of x, show that, to second order in g, 
he energy eigenvalues agree with (18.48). 

(b) For an attractive delta-function potential ( g > 0), show that if gL > 2 h 2 lm, 
in additional bound state with E < 0 arises as the root of 

tanh — = — — (18.50) 

2 mg 

vhere E = —ti 2 K 2 l2m. (Compare with the calculation in Section 6.4.) 


If K n cannot be determined directly, Eq. (18.32) can be solved by the eigenex- 
>ansion, 


K n = 2 


e*. 


J E {0) 


£(°) 


(18.51) 


2 Morse and Feshbach (1953), Section 5.2, reviews the theory of ordinary inhomogeneous linear 

liffat*antinl armotionc 
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as is verified by substitution and noting that H 0 P t = Ef } P ( . From 

p k vv™ = w™) = W* 

and Eqs. (18.34) and (18.51), it follows that 


4 


?-?> = 2 v 

k+n 


Vkn 

E? - E? 


(18.52) 


This is the first-order correction to the nth eigenvector in terms of unperturbed ei- 
genvectors. We use the concise notation V kn for the matrix element of V with respect 
to the unperturbed energy eigenstates. 

Substituting (18.34) or (18.52) into formula (18.36), we get for the second- 
order correction to the energy: 




nk v kn 


£?’ = on®. = 2 £(m _ Ep 


= 2 

k+n 


V, 


nk 


£' 0) - Ei 0) 


(18.53) 


It is evident from (18.52) and (18.53) that for the Rayleigh-Schrodinger per- 
turbation expansion to converge rapidly it is necessary that 


Vnk 

E? ~ E \ ® 


« 1 


(18.54) 


The differences of unperturbed energies that make their appearance in the denomi- 
nators of all of these expressions signal to us once more that degenerate unperturbed 
energy levels must be treated separately — unless the matrix element of V connecting 
two such states happens to vanish. 


Exercise 18.9. Calculate the perturbed energies up to the second-order 
correction is® for the delta-function perturbation located in the center of a one- 
dimensional box (Exercises 18.7 and 18.8). Show that the equality of £® to the 
earlier result is assured by the identity 3 


2 


t=i(e*n) n 


- e 


i 

4 n 2 


(n and i = odd) 


Exercise 18.10. Express (18.37) in terms of unperturbed energies and matrix 
elements of V in the representation of the unperturbed energy eigenstates. 


4. Electrostatic Polarization and the Dipole Moment. As an example, we con- 
sider an electron bound in an atom and placed in a weak uniform constant external 
electric field E. The field can be derived from an electrostatic potential 

4>(j) = — E • r 

where the coordinate origin is most conveniently chosen at the position of the nu- 
cleus and the perturbation potential is 

V — -etf) — eE ■ r 


3 This follows from formula 0.237 (No. 4) in Gradshteyn and Ryzhik (1965). 
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Unless the Green’s function G„(r, r') corresponding to the unperturbed energy 
E ^ 0) can be obtained in closed form, the energy of the system to second order is 
lsually expressed in terms of the expansion formula of the last section as 


E n = Ef + eE 


+ e 2 2 

k¥=n 


(E • r„ fc )(E • r kn ) 
E <°> ~ Ef 


(18.55) 


Adhere all matrix elements of the operator r are to be taken with respect to the 
mperturbed states. 

The shift of energy levels in an electric field is known as the Stark effect. The 
irst few terms of the perturbation expansion give accurate results for applied fields, 
vhich are small compared to the internal electric field of the atom. Since the latter 
s in order of magnitude given by |E| 10 11 V/m, in practice this condition is well 

iatisfied for applied static fields. Successive terms in the perturbation expansion 
iecrease rapidly and uniformly, except that some terms may vanish owing to certain 
symmetry properties of the system. 

If the unperturbed electron is in a central field, H 0 is invariant under coordinate 
nversion through the center of force, and the energy eigenstates may be taken to 
lave definite parity. We saw in Section 17.9 that the expectation value of the op- 
;rator r, which is odd under reflection, vanishes for states of definite parity. Hence, 
he external electric field can, in general, produce no first-order, or linear, Stark 
sffect. An exception arises if the central field is a pure Coulomb field (hydrogenic 
itoms) because the excited states of such atoms exhibit degeneracy of energy ei- 
genstates with opposite parity. Superposition of such states yields energy eigenstates 
hat have no definite parity, and the expectation value of r need no longer vanish. 
tVe will resume discussion of the linear Stark effect of the first excited energy level 
if hydrogen as an example of degenerate perturbation theory in Section 18.6. The 
nevitable degeneracy of the magnetic substates for states of nonzero angular mo- 
nentum, on the other hand, does not affect our conclusion concerning the absence 
if the linear Stark effect, because all these substates have the same parity (Section 
17.9). 

Usually, then, the first-order term in (18.55) is absent. The second-order term 
fives rise to the so-called quadratic Stark effect. If the electric field is along the z 
ixis, the quadratic Stark shift is given by the formula 

En = Ef + e 2 E 2 2 wi^Lo) (18.56) 

k#n tL n 


Dnce again, the parity selection rule ensures that the trivial degeneracy of magnetic 
ubstates does not interfere with the applicability of this formula, because two states 
hat differ only by their magnetic quantum number have the same parity. By in- 
fection of the work of Section 18.3, we see that such apparently indeterminate (0/0) 
erms are simply omitted from the sum in (18.56). 

Perturbation theory may also be used to calculate the expectation value of the 
;tatic electric dipole moment, —ex, in a stationary state of the one-electron atom. In 
he lowest approximation. 



(18.57) 


This is called the permanent electric dipole moment of the system because it rep- 
esents a vector which is determined by the unperturbed state of the system and is 
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entirely independent of the applied field. It vanishes for all states that possess def- 
inite parity. Conversely, its presence, as determined by measurement, signals either 
the occurrence of degeneracy or a breakdown of reflection symmetry. 

A better approximation is obtained by using the correction 


+ gM 


(D|2 


!*2T + 2 


( 0 ) 


E r 


(18.58) 


kn 


k&n 


E?) - E< 0) 


+ eW 2 


E 


l nk 


k+n 


e< 0) - eT 


The last two terms describe the electric polarization of the atom by the applied field. 
In this approximation, we obtain for the dipole moment of the one-electron atom: 


= ~ e i 


j3 _ 2 X 1 ^nk^kn "b ^krXnk T. 

pr 4*r = p 0 - e 1 2 j • E 


Ef-Ef 

where the last term represents the induced dipole moment in the state n. 


(18.59) 


P! = -e 2 2 


r nk r kn + r kn T, 


nk 


k&n 


E ®» - Ei°> 


E = a • E 


(18.60) 


This equation defines a tensor (or dyadic) of polarizability for the state n. 


2 V ^ni^kn ~b Yfcn ^nk 

6 L Ef> - E<°> 


(18.61) 


The polarizability tensor is symmetric, i.e., a tj = a n . In many applications, we find 
a xy = a yz = a = 0, and a xx = a yy = a zz , so that the polarizability is a scalar. 


Exercise 18.11. Calculate the polarizability of an isotropic harmonic oscillator 
from (18.61), and verify that the result agrees with an exact calculation of the in- 
duced dipole moment. 

Note that 


E • p = -J i/f*gVi// n d 3 r= E • p 0 + E • a • E + 0(E 3 ) (18.62) 

Comparing this with (18.42), we obtain 

E„ = Ef - E • p 0 - 4 E • a • E + . . . (18.63) 

which upon substitution of p 0 and a can easily be seen to be identical with expres- 
sion (18.55). The factor 1/2 that appears in the energy (18.63) owing to the induced 
dipole moment is the same as that customarily found when “stress” (E) and 
“strain ” (Pi) are proportional, as is the case in the approximation leading to (18.60). 

The accurate evaluation of the sums over unperturbed states, which are en- 
countered in higher-order perturbation calculations, is usually a difficult problem. 
Sometimes special techniques allow us to avoid the use of the unperturbed eigenstate 
expansion. As an example, consider again the first-order perturbation equation 
(18.13), 

(Ef - Ho)*™ = (V ~ E£>)^f 

and assume that it is possible to find, by whatever procedure and perhaps just guess- 
ing, an operator E„ that satisfies the equation 

(Ef - H 0 )F a V™ = [E„, = (V - E'W 


(18.64) 
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By inspection, we see that the particular solution ^ 1;> of (18.13) that is orthogonal 
to 'T 40) , and thus of paramount interest, is 

W = (/ - P„)F„^<°> (18.65) 


Correspondingly, the second-order energy shift is 

E m = (qr m V(I _ Pn)Fn ^) = (n\VF n \n) - (n\V\n){n\F n \n) (18.66) 

if - | n) is normalized to unity and the bra-ket notation is employed. The use- 
fulness of this method hinges on our ability to determine the operator F n . 

We illustrate this method by applying it to the level shift caused by a uniform 
electric field in the ground state |0) of the hydrogen atom. Since = <0| V|0) = 0 
in this case (no static dipole moment), it is easy to determine an operator F 0 from 
the equation 

[Fq, Hq\ 1 0) = e|E|z|0) (18.67) 


as a function of spatial coordinates only (and not the momenta). In the coordinate 
representation, this equation for F 0 becomes a differential equation that may be 
solved by separation of variables, with the result: 



e | E | ma 
h 2 


+ a )z 


(18.68) 


where a is the Bohr radius. 

The remaining expectation value in (18.66) is easily calculated by noting that 
for a spherically symmetric state ( S state): 

<0|/(r)z 2 |0> = | <0[/(r)r 2 |0> 

Hence, 


e I E | (0 1 zF 0 j 0) 


<0 1 r" f 0> 




,«+ 1 


(n + 2)! 


(18.69) 


Thus, we finally obtain 



(18.70) 


for the ground state of the hydrogen atom to second order in the applied electric 
field. An alternative method of deriving this result consists of using parabolic co- 
ordinates, in which the Schrodinger equation is separable even in the presence of a 
uniform electric field. 4 The presence of the field causes a lowering of the energy. 
This was expected from the application of formula (18.53) for/i^, since every term 
in the sum may be regarded as a repulsion of the nth level by the kth level. 


4 Bethe and Salpeter (1957), Sections 51-53. 
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Exercise 18.12. Verify that F 0 in (18.68) is a solution of Eq. (18.67) and, 
using F 0 , derive a closed-form expression for the first-order correction, ^^(r), to 
the ground state wave function for a hydrogen atom in a uniform electric field. 

5 . Degenerate Perturbation Theory. We must now extend our perturbation meth- 
ods by admitting the possibility of degeneracy among the unperturbed energies. Two 
or more unperturbed energies may be equal, or nearly equal, usually as a result of 
certain symmetries. Thus, if the system is a particle in a central-force field, the 
magnetic substates of a given angular momentum all have the same energy, owing 
to rotational symmetry. If this symmetry is disturbed, as by the application of a 
magnetic field, the degeneracy is usually removed. 

The perturbation procedure developed in Sections 18.1 through 18.3 cannot be 
applied without modification, because the expansion (18.8) of the perturbed energy 
eigenvector M r „ was based on the assumption that we know into which unperturbed 
eigenvector ' v J r ^ [ 0) collapses as g approaches zero continuously. While symmetry prop- 
erties can frequently be used to avoid this ambiguity, in general we have no prior 
knowledge that would allow us to predict what particular linear combination of the 
given degenerate eigenstates the eigenvector 'if n will converge to as g — > 0. Here 
we set ourselves the task of finding the “correct” unperturbed states on which a 
perturbation expansion can be built. 

The breakdown of the simple Rayleigh-Schrodinger theory in the case of de- 
generate unperturbed states formally appears as the vanishing of some of the energy 
denominators in formulas such as (18.51), (18.52), and (18.53). When this happens, 
the operator K and the corresponding Green’s function are undefined. The pertur- 
bation expansion becomes meaningless, unless, as a result of some symmetry, the 
matrix element of V in the numerator vanishes as well. For practical applications, it 
is important to realize that these difficulties arise not only if the unperturbed eigen- 
states are strictly degenerate, but also if they are merely so close in energy or linked 
by such large matrix elements of the perturbation V that | V kn /(E^ — E £ 0) ) | is large, 
violating condition (18.54) and causing unusually large mixing of unperturbed states 
in (18.52). 

In order to keep the notation uncluttered, we suppose that partitioning the Ham- 
iltonian H into an unperturbed operator H 0 and a perturbation V (with g — 1) has 
produced just two unperturbed energies, E^ and Ef\ which are close to each other 
(by comparison with the magnitude of Vj 2 ). For the special case of strict degeneracy, 
Ef = E ( 2 } . We take the corresponding eigenvectors 1 I r( 1 0) = 1 1) and T f 2 ° ) = [2) to be 
orthonormal. 

The problem of degenerate or nearly degenerate perturbation theory is to replace 
these two basis states by a new set of orthonormal states (the “correct” basis states) 
which may serve as zero-order approximation to the actual perturbed energy eigen- 
vectors. In essence, we attempt a partial diagonalization of the matrix representing 
the total Hamiltonian in the unperturbed eigenvector representation. This is the fa- 
miliar problem of matrix mechanics, except that it arises here in the context of 
matrices that, instead of having infinitely many rows and columns, have the dimen- 
sionality d of the degree of degeneracy of the nth level — two in our example. 

To derive the new “correct” basis vectors appropriate for perturbation theory, 
we construct the projection operator 

P 12 = Pi + P 2 = 1 1><1 1 + 1 2><2 1 


(18.71) 
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which projects any state vector into the two-dimensional subspace of the degenerate 
or nearly degenerate unperturbed eigenvectors. In this subspace, we define the trun- 
cated Hamiltonian 

P 12 HP l2 = £f 1 1)<1| + E?> 1 2)<2 1 + P 12 VP l2 (18.72) 

and then proceed to calculate its eigenvalues island £f exactly. The corresponding 
“correct” orthonormal eigenvectors, |l) and [2), are determined as linear combi- 
nations of the two original basis vectors J 1) and 1 2), so that 

P l2 HP 12 = If |T)<I| + If 1 2)(2 1 (18.73) 

We now define a new unperturbed Hamiltonian, 

H 0 = H 0 - £f 1 1><1 1 - Ef> 1 2><2 1 + If | I)(T | + If 1 2)<2 1 (18>74) 

= Hq — P\ 2 HqPi2 + P i 2 HP 12 = Ho "f Pi 2 VPi 2 
and a new perturbation operator 

V=H-H 0 =V+H 0 -H 0 = V- P 12 VP l2 (18.75) 

Since both H and Hq have zero off-diagonal matrix elements between the two new 
basis states j 1) and |2), and since the diagonal elements (zj H\ i) — (zj/f 0 |z), the matrix 
of the new perturbation V in this two-dimensional subspace is zero. Thus, the of- 
fending terms with vanishing denominators have been eliminated, and a new set, 
£f and E f, of “unperturbed” energy eigenvalues has been found. These differ 
from the original eigenvalues of H 0 by the approximate level shifts, for i = 1,2, 

A Ei « If - £f (18.76) 

The techniques of perturbation theory developed in Section 18.3 can now be applied 
to the Hamiltonian partitioned according to 

H = H 0 + V (18.77) 


The explicit solution of the eigenvalue problem of the auxiliary Hamiltonian 
P\ 2 HP\ 2 defined in (18.72), starts with construction of the matrix of P l2 HP X2 in the 
representation spanned by the original unperturbed energy eigenvectors 1 1) and |2): 


PhHP 12 


( Ef + (1 1 V| 1) <1|V|2> \ 

\ <2|V|1> Ef + {2\V\2)J 


(18.78) 


The eigenvectors 1 1) and 1 2) and the eigenvalues Zsf and £f are determined by the 
set of simultaneous homogeneous linear equations: 


/£f + (ljvji) <1V|2) V(1|0\ = 

\ (2\V\l) Ef + <2|V|2)A<2|i>/ ' \<2|7>/ 


(18.79) 


These equations yield the correct linear combination of the unperturbed zero- 
order energy eigenvectors and the perturbation corrections to the degenerate (or 
nearly degenerate) zero-order energies. The secular equation that must be satisfied 
if (18.79) is to have nontrivial solutions is familiar from Section 8.3, where essen- 
tially the same approximate energy eigenvalues and eigenvectors for degenerate or 
nearly degenerate unperturbed energy levels were derived by applying the varia- 
tional Rayleigh-Ritz method. Equation (18.79) is the same as Eq. (8.36), if we iden- 
tify the root E of the secular equation there withfif here. One of the two roots of 
this secular equations is Ef\ and the other is E 2 \ Explicitly, they are given by 
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formula (8.37), which shows that the corrected initial energies depend generally in 
a complicated way on the unperturbed energies and on the strength of the pertur- 
bation V. 

Summarizing and generalizing: If (i — 1, 2, . . . , d) is a set of d degenerate 
or nearly degenerate unperturbed levels, we construct the d X d matrix of the Ham- 
iltonian H with respect to the d-dimensional manifold of orthonormal unperturbed 
energy eigenstates, |£) 0) ). The d roots of the corresponding secular equation are the 
perturbed energies 1 up to the first order. They are often much more accurate than this 
statement implies, because the interactions between the sets of d states are taken 
into account to all orders. 

Exercise 18.13. Expanding the perturbed energies of two exactly degenerate 
unperturbed energy eigenstates (i = 1, 2) in powers of V as 

Ei = Ef> + . . . = Ef ] + + Ef ] + . . . 

derive the first order corrections Ej l) and show that the degeneracy is removed at 
this level of approximation unless (l|V|l) = <2 1 V| 2) and (l|V|2) = <2 1 V| 1) = 0. 

If the Hamiltonian H( A) of a system depends on a variable real parameter A, 
such as an external field, the energy eigenvalues are functions of A. The theory of 
this section can be used to discuss the conditions under which level crossing as a 
function of A may occur. Let us assume that for some value of the parameter, A = A 0 , 
two energy eigenvalues, 2?i(A 0 ) and E 2 ( A 0 ), are close to each other (and both are far 
from all other energy eigenvalues). Can we, by changing A, cause the energy levels 
Ej( A) to cross? Is there a nearby value of A = at which strict degeneracy occurs? 
Near the unperturbed A = A 0 , we expand the Hamiltonian to first order as 

H( A) = H( A 0 ) + (A - Ao)W'(Ao) ' (18.80) 

We identify an unperturbed Hamiltonian, H( A 0 ) = H 0 , and a perturbation 
(A - A 0 )//'(A 0 ) = V(A), such that V(A) —> 0 as A — > A 0 . For the two perturbed 
energies to be equal at A = A,, the coincidence of the two roots requires that the 
discriminant in (8.37), adapted to the present notation, must vanish: 

[E 2 ( A 0 ) + .(2, A 0 |V|2, A 0 > - ^(Ao) ~<1, A 0 | V|l, A 0 >] 2 

+ 4|U,a o |v|2,Ao>| 2 = 0 

Hence, it is necessary that 

(1, K\V(^)\2, A 0 ) = 0 or (l, \ 0 \H'(X 0 )\2, A 0 > = 0 (18.82) 

Generally, condition (18.82), which requires the vanishing of both the real and imag- 
inary parts of the off-diagonal matrix element, cannot be satisfied by any real value 
A 0 . However, symmetry properties often cause (18.82) to hold identically, indepen- 
dent of the value of A 0 . Then the two energy levels may be made to intersect, but 
otherwise a no-crossing rule applies. This behavior of energies as a function of a 
parameter will be illustrated in Section 18.6. 


Exercise 18.14. If H{ A) depends on a parameter A and if M^A) is its eigen- 
vector that is normalized to unity, prove the Hellmann-Feynman theorem. 


dE„{\) 

dk 



dH( A) 
3A 



(18.83) 
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Show that if condition (18.82) is satisfied, the energy levels cross approximately at 


A] — A 0 


£i(A) - E 2 ( A) 
E[( A) - E[( A) 



^0 


A fuller explanation is in order to clarify the effects of symmetries in pertur- 
bation theory. In general, the perturbation calculation for a J-dimensional manifold 
of degenerate or nearly degenerate unperturbed eigenstates requires us to compute 
the eigenvalues of a d X d matrix. Although it is possible to solve secular equations 
of high dimensionality, we can save labor and gain some physical insight if we take 
full advantage of the symmetry properties of the system. If certain symmetries of 
the unperturbed system survive in the presence of the perturbation, there exist com- 
muting Hermitian operators A, such as generators of the symmetry operations, which 
commute both with H 0 and H, hence also with their difference, the perturbation V: 

[H 0 , A] = [H, A] = [V, A] = 0 (18.84) 

But V need not commute with H 0 . For example, if an electron moves in a central 
field and is perturbed by a uniform external magnetic or electric field that is parallel 
to the z direction, the rotational symmetry about the z axis is preserved. As a con- 
sequence, L z commutes with H 0 , H, and V. ( L x or L y also commutes with H 0 , but not 
with V or H.) If an operator A satisfying (18.84) exists, the eigenvectors of H 0 and 
H may be arranged to be also eigenvectors of A. In constructing the correct linear 
combinations of unperturbed energy eigenvectors, we need to include in the calcu- 
lation only those eigenvectors that belong to the same eigenvalue, A ' of A. Formally, 
the simplification comes about because if the constant of the motion A and the 
perturbation V commute, 

<A'|y|A") = 0 (18.85) 

unless A' — A". Hence, the perturbation matrix has been partially diagonalized and 
the computational burden reduced, often significantly. 

The selection rule embodied in Eq. (18.85) can be* expressed in the language 
of group theory by saying that a perturbation obeying a certain symmetry does not 
mix states, which under the operations of the relevant symmetry group transform 
according to (or “belong” to) different irreducible representations, labeled by the 
eigenvalues of A. 

We conclude this section with some general remarks about the perturbation 
expansion. The Rayleigh-Schrodinger scheme is simple, but its practical applicabil- 
ity is in many cases limited by poor convergence, except when the perturbation is 
quite weak. It has therefore been found to be most useful when the perturbation V 
is produced by an external field whose magnitude can be controlled experimentally. 
For instance, by adjusting an applied electric or magnetic field, we can conduct an 
experiment at a perturbation level sufficiently low to warrant applicability of the 
power series expansion. 

Frequently, however, perturbation methods are called for in the treatment of 
systems with interactions V over which we have no control. For example, the Cou- 
lomb interaction between electrons in an atom or the force between nucleons in a 
nucleus may or may not be weak, but we certainly cannot influence it. Everywhere 
in many-body physics we encounter such inherent perturbations affecting the be- 
havior of the unperturbed constituents. 

Once the restricted i-dimensional eigenvalue problem for a set of degenerate 
or nearly degenerate unperturbed eigenstates has been solved exactly, we can pro- 
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ceed with the Rayleigh-Schrodinger formalism to the desired order. An alternative 
perturbation method is based on treating the right-hand side of 

(E n - H o n„ = VV n (18.86) 

as a known quantity. If the perturbation is assumed to produce a finite level shift, 
A E n = E n — E®\ the homogeneous equation, obtained by setting the right-hand side 
of (18.86) equal to zero, has no nontrivial solution, and we may write (case 2 in 
Section 18.2) » 




n 


1 

E n ~ H 0 




(18.87) 


This equation is projected onto the orthocomplement of the ^-dimensional subspace 
of unperturbed eigenstates corresponding to eigenvalue E , ( , 0) of H 0 by multiplying 
both sides of the equation by 1 — P n . The result is 


= Pn^n + U ~ P„) 





(18.88) 


effecting a separation of the unperturbed eigenstate from a perturbed component that 
is orthogonal to the unperturbed state similar to the procedure used in the Rayleigh- 
Schrodinger form of perturbation theory. In the coordinate representation, (18.88) 
again takes the form of an integral equation. It is important to observe, however, 
that on the right-hand side of (18.88) there is not only the unknown state M/ 1 ,,, but 
also the unknown energy E n (rather than the known unperturbed energy £^ 0) ). Equa- 
tion (18.88) is supplemented by the condition 



0P„, PnWn) 

0F„, P n V n ) 


(18.89) 


which follows from (18.86) and is a generalization of (18.6). Together, (18.88) and 
(18.89) can be used to launch a perturbation expansion by successive iteration. This 
approximation scheme is known as Brillouin-Wigner perturbation theory. 

For the elementary applications of lowest-order perturbation theory to atoms 
in the next section, there is no distinction between the Rayleigh-Schrodinger and 
Brillouin-Wigner schemes. 


Exercise 18.15. By repeated iteration of Eq. (18.88), derive a perturbation 
series for M r „ for “small” V and show that this is formally much simpler than the 
corresponding expansion in the Rayleigh-Schrodinger theory. 

6. Applications to Atoms. We are now in a position to solve a great many realistic 
physical problems, and this section contains a sampling from atomic spectroscopy. 

The theory of the linear Stark effect in the hydrogen atom can be used to illus- 
trate degenerate perturbation theory. The nonrelativistic Hamiltonian is 

n 2 g 2 

H = f + e|E|z (18.90) 

2m r 

where the last term is regarded as a perturbation. Since the spin of the electron is 
irrelevant at this level of approximation and therefore ignored, the ground state of 
hydrogen (IS) is nondegenerate and has even parity. Hence, no linear Stark effect 
occurs, and there is no permanent electric dipole moment. The situation is different 
for the excited states, of which we will treat only the lowest ones. The 2 S state and 
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the three 2P states are degenerate, the former being of even parity, whereas the latter 
three are odd. The perturbation itself is odd and thus can mix S and P states. At first 
sight, degenerate perturbation theory requires that we allow the correct unperturbed 
energy eigenfunctions to be linear combinations of all four unperturbed eigenstates. 
However, H shares cylindrical (or “axial”) symmetry with H 0 , and L z commutes 
with H. Hence, the perturbed eigenstates can still be required to be eigenstates of 
L z and characterized by the eigenvalues mh, where m = — 1 , 0, or + 1 . This quantum 
number classifies the one-dimensional irreducible representations of the rotation 
group in two dimensions, ^(2), appropriate to the cylindrical symmetry of the prob- 
lem. The selection rule for the matrix elements of the perturbation V = e\E\z and 
of H is A m — 0. 

If we denote the several eigenstates of the unperturbed energy level n by \n€m), 
we see that for n = 2 only the states [2, 0, 0) and \2, 1, 0) are mixed by the 
perturbation. The states \2, 1, 1) and 1 2, 1, -1) are single and remain so. They 
exhibit no linear Start effect because they have definite parity. What remains to solve 
is the simple secular equation, 


e | E | (2, 0, 0 1 z 1 2, 0, 0) - A E ± 
e\E\(2, 1, 0|z|2, 0, 0) 


e | E | (2, 0, 0 1 z 1 2, 1,0) 
*|E|<2, l,0|z|2, 1,0) - A E ± 


(18.91) 


where A E is the energy shift between the perturbed and the unperturbed degenerate 
states. Because of conservation of parity, the diagonal elements (2, 0, 0|z|2, 0, 0) 
and (2, 1, 0)z|2, 1, 0) vanish. Hence, the first-order change in energy is 

A E ± = ± e | E || <2, 0, 0 1 z 1 2, 1 , 0) | (18.92) 


and only one matrix element has to be evaluated. For this purpose, it is necessary 
to use the unperturbed energy eigenfunctions explicitly. They are (see Chapter 12): 

3/2 


2 - °> : « - vfe (k ( 2 _ -) exp ( 
2 P(m = 0): 1/i‘S = 77= f ~'| - ex p( “^ l 


■s) 

cos 6 


VAn \2aJ a 

We calculate 

(2, 0, 0 1 z 1 2, 1, 0) - <2, 0, 0|r cos 0|2, 1, 0) 

= h d) i j» r r r< ( 2 ■ 3 -(") c ° s2 9 sin 9 *■ m d * = - 3 ° 

Hence, the linear Stark effect splits the degenerate m — 0 level into two components, 
the shifts being 


A E ± = ±3ae | E j 

The corresponding eigenfunctions are easily seen to be 

~ ~\/2 + 


(18.93) 


(18.94) 


mixing the two different parity components in equal proportions. 

The degeneracy of the 2 S and 2 P states in the hydrogen atom is broken by any 
perturbation of the pure Coulomb field. Thus, the 2s and 2 p levels are no longer 
degenerate in either the lithium atom or the heavy atoms, where they constitute 
components of the inner L-shell. In both cases, there are central screening fields that 
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modify the pure Coulomb field. On the other hand, the degeneracy of the 2€ + 1 
unperturbed magnetic substates survives the addition of such perturbations. But in 
an external field along the z direction, the energy eigenstates are eigenstates of L z 
because of cylindrical symmetry. This fact allows us to avoid the use of degenerate 
perturbation theory, once levels of different € are split appreciably. The strict 2S-2P 
degeneracy in the hydrogen atom also disappears when the electron is treated as a 
relativistic particle with spin and its interaction with the electromagnetic field is 
taken fully into account (Lamb shift). 

Exercise 18.16. Calculate the linear Stark effect for the n = 3 levels of 
hydrogen. 

Next, we will deal more accurately than before with the motion of an electron 
in a central field with the inclusion of spin effects. We saw in Chapter 16 that a 
spin-orbit interaction proportional to the operator L • S must be included in the 
Hamiltonian. Its strength is given by a coefficient 2 W(r)/h about which more will 
be said later. The Schrodinger equation for the spinning electron in an atom is thus 
of the two-component spinor form 

h 2 

V 2 + V(r) + W(r)L • or 

2m 

Actually, Eq. (18.95) is not adequate to yield precise values for the fine structure of 
the hydrogen energy levels because these are affected by relativistic corrections to 
the kinetic operator, — (ft 2 /2m)V 2 , as much as by the specific spin-orbit interaction. 
Indeed, there is no advantage in using a fundamentally nonrelativistic Schrodinger 
equation like (18.95) for the hydrogen atom. This simple system requires and can 
be accorded a much more accurate treatment by the use of the fully relativistic Dirac 
equation of the electron. Such a treatment will be given in Section 24.9. 

Equation (18.95), however, can still be regarded as a useful guide to under- 
standing the electron spin’s general role in effective one-electron systems. For ex- 
ample, an investigation of (18.95) can give us insight into the qualitative features 
of the alkali spectra. An alkali atom may indeed, for many purposes, be regarded as 
a one-electron atom, with all electrons in closed shells contributing merely to an 
effective screening of the electrostatic potential in which the single valence electron 
moves. 

The unperturbed Hamiltonian, 

H 0 = -f- V 2 + V(r) (18.96) 

2m 

represents a familiar central-force problem. We know from Chapter 12 that its ei- 
genfunctions may be assumed also to be eigenfunctions of L z and L 2 . The operator 
H 0 , which contains no reference to the spin variable, commutes with S z and S 2 , and 
it is thus possible to express the unperturbed eigenfunctions of H 0 as 

R ne (r)Y?(6, <p)a and R nt (r)Y?(d, <p)(3 

Here n denotes a radial quantum number that characterizes the unperturbed energy 
eigenvalues, which are in general 2{2t + l)-fold degenerate. Straightforward ap- 
plication of the methods of Section 18.5 would require us first to diagonalize the 
submatrix of the perturbation matrix with respect to these degenerate eigenstates. 
However, a great simplification results if it is recognized that, although neither L 
nor S commutes with the complete Ham i ltonian H — H 0 + W(r) L • or, the total 



= E 



(18.95) 
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angular momentum vector operator, J = L + S, does commute with H, as explained 
in Section 16.4. The simultaneous eigenstates of the mutually commuting coupled 
operators L 2 , S 2 , J 2 , and J z (instead of the uncoupled operators L 2 , L z , S 2 , S z ) make 
up a suitable set of correct unperturbed eigenfunctions, since the perturbation matrix 
is automatically diagonal with respect to them. We have already calculated these 
eigenstates, and they are given explicitly in formula (17.64). However, because of 
the identity 2L ■ S = J 2 — L 2 — S 2 , we do not need to know the detailed form of 
the eigenfunctions if we only wish to evaluate the first-order change in energy due 
to the spin-orbit interaction: 

A E = \ <£< 0) , € \jm\W(r)Q 2 - L 2 - S 2 )|£f, € \jm) 

* (18.97) 

= [/•(; + 1) - m + 1) - lift J o ” \R«e(r)\ 2 W(r)r 2 dr 
or 

AE - {-« + i J /»" w(ry dr for • : t - 1 < i8 - 9s > 

From this formula ( Lande’s interval rule) the so-called fine structure splitting can 
be evaluated if W(r) is known. 

As was pointed out in Section 16.1, a spin-orbit interaction in an atom arises 
from the interaction between the Coulomb field of the nucleus and the intrinsic 
magnetic moment of the orbiting electron. The energy associated with this interac- 
tion is 

tfmagn = ~ X E = S ■ T X ^ (18.99) 

c me 

The potential energy due to the central force is V(r) = — efi. Hence, 

r dV 

eE = -eVfi = 

r dr 

and 


“magn 2 2 j 

m e r dr 


L S 


(18.100) 


When the actual calculation is made with the proper Lorentz transformations for the 
fields, it is found that owing to purely kinematic effects we must add a term to the 
energy, which has the same form as (18.100) but a different coefficient. Known as 
the Thomas term, this contribution to the Hamiltonian is 




Thomas 


1 

— H 

2 ^magn 


( 18 . 101 ) 


Such a term is expected whether or not the potential is of electromagnetic origin, 
and would be present even if the magnetic moment of the electron were zero. 5 The 
total spin-orbit interaction in atoms is the sum of expressions (18.100) and (18.101), 

^spin-orbit = Z 2~2 ~ ~T~ ^ ® (18.102) 

2 m e r dr 


5 .Tackson (1975t. Section 11.8. 



6 Applications to Atoms 


471 


Hence, the splitting of a level of given € and n into two components with j = € + 1/2 
and j = € — 1/2 according to (18.98) and (18.102) is determined by the interaction 
potential 


W(r ) = 


ft 1 JV 
4m 2 c 2 r dr 


(18.103) 


For an attractive ^potential, this quantity is positive; consequently, the level with 
j = € — 1/2 lies below that with j = € + 1/2. 

In order to get an idea of the magnitudes involved, we note that if V(r) were a 
pure Coulomb potential, 

Wr)> = ; h*{?) 

The evaluation of the fine structure splitting A E requires special care for S states 
(€ = 0), because (18.98) becomes indeterminate (0 X co). 6 Since these difficulties 
will be avoided by a proper calculation with the relativistic Dirac equation in Chapter 
24, it is for our present purposes sufficient to estimate for the nth orbit: 

Z 3 _ / Zc 2 mV 

i 3 a 3 \ h 2 n ) 



where a is the Bohr radius of hydrogen. Hence, owing to the Bohr energy formula 

( 12 . 68 ), 



The dimensionless constant, 


e 2 

1 

a ~ he ~ 

137.036 


(18.104) 


is called the fine structure constant. Its small value is responsible for the effective- 
ness of perturbation methods in atomic physics and for the smallness of the fine 
structure splitting, A E (+V2 ~~ A E ( - l/2 , compared with the gross structure energies, 

ET-- 

A£i+i /2 ~ A£ € _ 1/2 ^ (Za) 2 

|£i 0) | " n 

The fine structure constant a is of much more general significance than its 
introduction in the present special context suggests. As a pure number that is in- 
dependent of the units chosen, it has an absolute meaning, measuring the relative 
magnitudes of the Bohr radius, the Compton wavelength, and the classical radius of 
the electron: 


? • » * • 9 — i . . u . . u 

me me me 

If h and c are considered more fundamental than e, the fine structure constant be- 
comes a measure of the electronic charge. This point of view is dominant in quantum 

6 Bethe and Salpeter (1957), Section 13. 
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field theory where a plays a role of the coupling constant between charged matter 
and electromagnetic fields. 

When an atom is exposed to a constant uniform magnetic field, the energy levels 
split further. If we neglect terms that are quadratic in the field strength (which induce 
small effects such as diamagnetism) and choose the z axis conveniently in the di- 
rection of the external field B, the central-force Hamiltonian (18.96) is perturbed by 
an interaction energy 

H' = \ W(r)L -S + ^~ ( g L L z + g s S z ) (18.105) 

n 2 me 

Equation (17.101) has been used for the magnetic moment of the atom, with g L = 1 
and g s = 2 for a one-electron atom. 

If the matrix elements of H' are small compared with the energy-level separa- 
tions of Hq, first-order perturbation theory requires, according to Section 18.5, that 
we solve the eigenvalue problem of H' in a space of degenerate eigenstates of H 0 . 
For a general atom, the zero-order energies are labeled by a set of quantum numbers 
of which only € and s are relevant for the present purpose, because the corresponding 
operators L 2 and S 2 commute with H' . (Unlike an electric field, a magnetic field 
does not mix states of different £.) 

In the (2€ + l)(2s + l)-dimensional vector space characterized by the quantum 
numbers € and 5, the perturbation may be written in the form 

H' = AL • S + nL z + vS z (18.106) 

which, appropriately interpreted, is sufficiently general to render the present discus- 
sion applicable to a large class of problems involving complex as well as one- 
electron atoms, and a variety of other types of angular momentum couplings, ,such 
as hyperfine interactions. The coefficients A, ju, v in (18.106) are real constants. 
Their values are assumed to have been computed separately. 

The eigenvalue problem posed by the effective Hamiltonian H' may be solved 
in any representation, but a judicious choice of basis can save much labor. In order 
to minimize the work of computing matrix elements, it is desirable to use as many 
constants of the motion as possible for specifying the complete set of operators 
whose eigenvectors are to serve as basis. We thus see the advisability of selecting 
L 2 , S 2 , J 2 , J z as the appropriate complete set spanning the coupled representation 
\isjm). Only J 2 fails to commute with H' , and the only nonvanishing off-diagonal 
matrix elements are, therefore, of the form {%sj'm\H'\tsjm). In fact, since H' can 
be rewritten as 

H' =\ (J 2 - L 2 - S 2 ) + pJ z + (v~ p)S z (18.107) 

we only have to evaluate (isj'm\S z \isjm). 

The evaluation of this matrix element of S z is accomplished by using the defin- 
ing relation (17.52) of the C-G coefficients. We obtain 

(£sj'm\S z \isjm) = h 2 m s {m — m s , m s \j'm){m — m s , m s \jm ) (18.108) 

m s 

We now specialize to the case s — 1/2, which includes one-electron atoms. 
Using Eqs. (17.63) in (18.108), we obtain 

TYITL 

(€ i € ± i m\S z \e i € ± i m) = + ^ 


(18.109) 
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for the diagonal matrix elements, and 

<€ i € ± i m\S t \€ M + i w) = ~ 2t + l + ^ ~ ^ ^ 18 - 110 ^ 


for the off-diagonal matrix elements. 

As an important example, we finally consider a 2 P state (€ = 1) with total 
angular momentupi j — 3/2 and j — 1/2. The manifold of substates now is six- 
dimensional. Using the symbolic notation of Eq. (17.71), we write the equivalence 
of the uncoupled and coupled basis in the six-dimensional space as 

3 (x) 2 = 4 © 2 (18.111) 


The operator S z has nonvanishing off-diagonal matrix elements only between 
the basis states | jm) — \\, |) and ||, |) and between the states \jm) = ||, — |) and 
||, — |). The states | jm) = jf, |) and | jm) = ||, — |) are eigenvectors of H', and the 
remaining eigenvectors are of the form 

i) + b ||, |) and c|f, -|) + d\\, ~|> 


The eigenvalues of H' are corrections to the unperturbed energy, which take the 
interactions among the six basis states of the 2 P manifold exactly into account, while 
totally ignoring the interactions involving all other basis states. Denoted by E m , the 
perturbed energies are 


£±3/2 = | h 2 ± (/a + h 


(18.112) 


and the roots of the secular equation 


^-h 2 ±\ (2 jx + v)h - E ± 
2 6 

V2 

— (/a - v)h 


V2 


1/2 


(jjb — v)h 


A h 2 ± - (4/a — v)fi — E± 1/2 
6 


- 0 (18.113) 


There are four roots, two for m = 1/2 and two for m — —1/2. 


Exercise 18.17. Check the eigenvalue Eqs. (18.112) and (18.113). 

For the case of a one-electron atom ( g L = 1, g s = 2), the perturbed 2 P energies 
are plotted in Figure 18.1 as a function of the applied magnetic field of strength B. 
The resulting splitting of spectral lines is known as the Zeeman effect. The operator 
J z commutes with H' for all values of the magnetic field, and m is thus a “ good 
quantum number ’ ’ throughout. In conformity with the no-crossing rule discussed in 
the last section, only levels belonging to different values of m are seen to cross for 
finite B. 


Exercise 18.18. Carry out the calculation of the Zeeman effect in detail. Show 
that, if the external field is either very small or very large compared with the internal 
magnetic field of the atom, the energy depends linearly on B. Show that the internal 
field which produces the fine structure is of the order 1 T or 10 4 gauss. 

7. The Variational Method and Perturbation Theory. In many applications of 
quantum mechanics, the Schrodinger equation cannot be solved rigorously, nor can 
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Figure 18.1. Splitting of a 2 P energy level of hydrogen in a magnetic field B (Zeeman 
effect). The unperturbed energy is £ (0) , and the fine structure splitting is A E. 


a neighboring unperturbed Hamiltonian be found which affords a good approxima- 
tion and a suitable starting point for a perturbation treatment. One then often falls 
back on the equivalence of the eigenvalue problem with a variational principle for 
H. For the standard one-particle Schrodinger equation, this equivalence was dem- 
onstrated and illustrated in Chapter 8. A more general proof was the subject of 
Section 10.3. Here, we repeat a shorthand derivation of this fundamental proposition. 

First, we show that the eigenvectors of H are the vectors for which the variation 
of the expectation value 


(H) 


QP, HV) 
OP, 'P) 


(18.114) 


vanishes. Generalizing the argument in Section 8.1, we have for a small change S' P, 

OP, 'P ) 2 8(H) = OP, 'P^S'P, H'P) + OP, HS'P)] (18.115) 

~0P, tf'P^S'P, ^) + (^, S^)] + 0[(S^) 2 ] 

Since H is Hermitian, the vectors 'P = 'P,, which make 8{H) = 0 to first order in 
S'P must satisfy an equation 


HV n = E n V n 


(18.116) 
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Conversely, every eigenvector of H makes the variation of (H) zero. The stationary 
values of (H) are the eigenvalues E n . 

As an almost trivial application of the variational method, consider a Rayleigh- 
Schrodinger perturbation calculation of an energy eigenvalue E n of a system with 
Hamiltoniann H. Using the notation of Section 18.1 (with the expansion parameter 
g), we observe that the kth approximation to the eigenstate, 

SP H « + ■ ■ ■ + g k ^ (k) (18.117) 

differs from the correct eigenstate by terms of order g k+l . If the approximate 
vector (18.117) is used as a trial vector in (18.114), we see from (18.115) that the 
value of (H) will differ from E n by terms of order g 2k+2 . It follows that ( H ) computed 
with the trial vector (18.117) gives E n correctly to order g 2k+1 . This is a strong and 
useful result. 

I 

Exercise 18.19. Illustrate the connection between the variational and the Ray- 
leigh-Schrodinger perturbation methods by applying a trial vector 

+ gW™ 

to approximate an energy eigenstate of the perturbed Hamiltonian H = H 0 + gV. 
Compare with the results of Exercise 18.6. 

In practice, variational estimates are most easily and reliably made for the 
ground state of a system. Since we cannot extract infinite amounts of energy from 
any physical system, the expectation value of the energy must have a lower bound. 
Hence, if H represents the energy, the absolute minimum of (H) is the lowest energy 
eigenvalue E 0 , and this value is reached if the trial vector used to estimate (H) is an 
eigenvector corresponding to E 0 . 

In principle, the variational method can be used for energy eigenstates other 
than the ground state. We already saw in Section 10.3 how it can be applied to 
eigenstates of successively higher energy by gradually narrowing the vector space 
in which H operates. If the ground state T'q has been obtained, its subspace is split 
off and the variational procedure is repeated in the remaining subspace of all vectors 
that are orthogonal to MV Obviously, the method will give more and more inaccurate 
results if M'o is known only approximately. This difficulty can be circumvented if it 
is possible to construct a subset of trial vectors that are known to be rigorously 
orthogonal to the exact MV For instance, in the case of a particle in a central field, 
where the ground state is an S state, we restrict ourselves to the use of spherically 
symmetric trial functions to estimate E 0 . Similarly, by using R-state wave functions, 
we can obtain an upper limit for the lowest P state, and so on. 

If we know some constant of the motion A, such that 

[H, A] = 0 (18.118) 

the projection operator P A -., which projects any vector into the subspace spanned by 
the eigenvectors of A with eigenvalue A\ is also a constant of the motion. Hence, 

[H,P a[ ] = 0 (18.119) 

From this commutation relation, it follows that if M7„ is an eigenvector of H, P A AV n 
(if it is not zero) must also be an eigenvector of H, corresponding to the same 
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eigenvalue, E„. By limiting ourselves to variations in the subspace characterized by 
A'i, we thus establish that the condition 


* (P A y, HP a V) n 
8 ^ — - = 0 
(Pa?, Pa?) 


(18.120) 


generates the energy eigenvalues and eigenvectors that carry the label A •. In partic- 
ular, the absolute minimum of (H) in the subspace of the vectors P A ? gives us the 
lowest energy eigenvalue with the property A \. 

A generalization of the variational principle can be formulated if we vary not 
only the state vector ''P but also the Hamiltonian H. Denoting such generalized vari- 
ations by the symbol A, and neglecting terms that are quadratic in A, we have by 
the usual rule concerning the variation of a product: 


A (H) = A 


QP, Hty) 
OP, 'P) 


QP, A H'P) QP, HV) 

OP, ^P) OP, 


(18.121) 


where the S-variation is one in which the operator is kept fixed, while *P is varied. 
If the variation is performed near an eigenvector , 'P„, the last term vanishes, and we 
are left with 


A QP„, HV„) _ PP», AHV n ) 

Wn, *n) 


(18.122) 


Conversely, any vector ^P„ for which (18.122) holds is an eigenvector of H. 

So far, the variations of 'P and H have been independent. If we now restrict 
them by requiring that 'P remain an eigenvector of H during the variation, A (H) 
becomes the change A E n of the eigenvalue E n , and we get 


Cgg, Aff'PJ 

0P„, ^n) 


(18.123) 


which, in slightly different notation, is identical with the energy shift (18.9) in first- 
order perturbation theory. 

From (18.122) and (18.123), the variational theorem may be re-expressed in a 
generalized form as follows: 'P is an eigenvector of H if and only if A (H) depends 
only on the change in H, and not on the change in 'P. When condition (18.122) is 
achieved, the value of A (H) is, to first order, equal to the change of the corresponding 
energy eigenvalue. 


Exercise 18.20. Prove the Hellmann-Feynman theorem (18.83), using a vari- 
ational argument. 


Exercise 18.21. Combining the virial theorem and conservation of energy with 
the Hellmann-Feynman theorem, prove that the energy of a bound state for the 
potential V = Ar n scales as A 2 ' {n+2) . Also show that E scales with the particle mass 
as m~ nKn+2) , if the potential is independent of m. Check these results for the Coulomb 
potential, the linear potential, and the harmonic oscillator. 


8. The Helium Atom. The neutral helium atom with a fixed nucleus is described 
by the Schrodinger equation in configuration space: 


h 2 2e 2 

— (V' 2 + V" 2 ) - 

2m r 


2e 2 


+ 


0(r\ r") = Etftr', r") (18.124) 
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The coordinates of the two electrons are labeled r' and r" under the provisional 
assumption that the particles are in principle distinguishable. We know this as- 
sumption to be false but, since electrons have spin one-half with two possible sub- 
states, and to the extent that spin-orbit interactions can be neglected, the fiction of 
two (but no more!) distinguishable electrons can be maintained in solving Eq. 
(18.124). We may think of one electron as having spin “up” and the other spin 
“down,” so that the two particles belong in effect to two different, distinguishable 
electron species. We will see that with this assumption we can obtain the entire 
energy spectrum (but not all the states) of a two-electron system. The corrections to 
this picture, imposed by the identity of the two electrons in the same spin state and 
by the Pauli exclusion principle, will be found in Chapter 22. 

Owing to the symmetry of the differential operator in (18.124), the solutions of 
the Schrodinger equation 

H{ r\ r")tKr\ r") = Eift( r\ r") (18.125) 

fall naturally into two classes: Every solution can be assumed to be either symmetric 
or antisymmetric in the space coordinates: 

r') = ±0(r', r") (18.126) 

If a particular solution /( r', r") fails to have this property, we observe that f(r", r') 
is also a solution — a phenomenon that is referred to as exchange degeneracy, be- 
cause both solutions belong to the same energy eigenvalue. Hence, owing to the 
linearity of the Schrodinger equation, the linear combinations 

<Mr', r") = l - [/(r\ r") ± /( r", r')] 


are also solutions. Since conversely every solution can be written as a superposition 
of solutions with definite exchange symmetry, 

/( r\ r") - i/f+(r', r") + «A_( r', r") 

we lose nothing by confining our attention to the symmetric and antisymmetric so- 
lutions of the two-electron Schrodinger equation (18.124). 

It is tempting to neglect in lowest approximation the repulsive interaction be- 
tween the electrons and treat the neglected part of the potential energy as a pertur- 
bation. Since the repulsive term is clearly of the same order of magnitude as the 
Coulomb attraction due to the nucleus, there is no a priori justification for this 
procedure other than the simplification which results from it, providing a qualitative 
understanding of the level scheme of helium. 

The simplification results because the approximate zero-order Schrodinger 
equation, 

T h 2 2e 2 2e 2 ~\ 

(V' 2 + V" 2 ) - - =- L«V, r") = £ (0) < A ( 0 V, r") 

[_ 2m r r J 

is further separable, if we assume that E (0) — E' + E" and that 

<A ( 0 ) (r\ r") = g(r')h(r") 


The separated equations are both hydrogenic: 




E'gix') 




h{ r") = E"h( r") 
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The lowest unperturbed energy level of the atom (Z = 2) corresponds to 
E' = E" = — Z 2 X 13.6 eV = -4 X 13.6 eV = —54.4 eV. The corresponding 
unperturbed ground state energy eigenfunction is 




Z(r' + r") 
a 


(18.127) 


which is a product of two hydrogenic Is wave functions. This is symmetric under 
exchange. 

The first-order correction to the energy is 


A E = J f !</4 0) (r\ r")P 


r' - r 


5e 2 5 

d 3 r' d 3 r" = — - = - X 13.6 = 34 eV 
4 a 2 


The total ground state energy in first approximation is then 

£ ca i = E (0) + (V(r', r")} = -2 X 54.4 + 34 = -74.8 eV 

The measured ionization potential of helium is 24.46 V. Hence, the total energy of 
the atom is 


E obs = -54.4 - 24.5 = -78.9 eV 

E c& \ > E ohs in agreement with the variational principle. 

The agreement between theory and experiment can be improved by using a 
better trial function than (18.127). The design of suitable trial functions has been 
the subject of ceaseless investigation ever since quantum mechanics was invented. 
The problem offers a challenge to the theoretician because the two-electron system 
is mathematically manageable, although solutions in closed form cannot be found. 
Notable efforts have been made in devising better and better variational trial func- 
tions for the ground state of helium, the test of their quality being how closely (H) 
lies above the precisely measured energy. 

We content ourselves with describing the simplest variational method. It uses 
trial functions of the form (18.127) but with Z replaced by an effective nuclear 
charge, Z eff = Z — <x. This is physically reasonable, because each electron is partially 
screened from exposure to the full charge of the nucleus by the presence of the other 
electron. If cr is left arbitrary, we can calculate the expectation value of the energy 

E(a) = (H) 

where H is the complete Hamiltonian of the helium atom, including the interaction 
between the electrons. According to the variational theorem, E(cr ) > E for any value 
of cr. The best estimate with a trial function of the form (18.127) is therefore the 
lowest upper bound, obtained by minimizing E(o) with respect to a. The result, for 
any two-electron atom with nuclear charge Z, is 



Exercise 18.22. Work out this result. Show that the minimum of E(c r) is ob- 
tained for a — 5/16, suggesting that in the ground state the screening effect from 
one electron amounts to one-third of an electron charge e. Show that the minimum 
value is 



(18.128) 


17 rv h nllii tvi 


m *»''**• on i ninmi/pmpnt is this over the earlier crude estimate? 
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Exercise 18.23. Show that if an unperturbed Hamiltonian 

„ # rv ,, . (2 - 5/16) C j (2 - 5/16)e 2 

H °~ “2^ (V +7 ) 7 7 


(18.129) 


is chosen, the first-order perturbation energy of the ground state of helium vanishes. 


In lowest approximation, the excited states of helium are obtained by lifting 
one electron to an excited hydrogenic level and leaving the other electron in the Is 
level. The corresponding zero-order wave functions are 

>P ( ± = t^= [<Aioo(r')<A„ em{ (r") ± ipmm f (r')ip l00 (r")] (18.130) 

with an energy £ ,(0) = E 0 + E n . The unperturbed energy levels are degenerate exactly 
as in the hydrogen atom, but the interaction between the two electrons removes the 
€ degeneracy. The unperturbed eigenfunctions (18.130) are just the correct linear 
combinations to begin a degenerate perturbation treatment: The perturbation poten- 
tial is diagonal with respect to any two states of the type (18.130), because total 
orbital angular momentum L is a constant of the motion and because two states of 
different exchange symmetry cannot mix. 

The excited states thus split into components according to their symmetry under 
exchange of the particle coordinates. The energy correction due to the symmetric 
perturbation V(r', r") is 

AE ± = J j r\ r")V(r', r")ip ( 2\r’, r") d 3 r' d 3 r" = / ± J (18.131) 
Here the so-called direct integral, 

1= j j </4o(r ')^m e ix")V(v' , r")«Aioo(r')fe m< (r") d 3 r' d 3 r" (18.132) 

is the expectation value of the interaction energy, which would be the entire first- 
order perturbation correction if exchange degeneracy were absent from the unper- 
turbed Hamiltonian, or if if/ l00 and did not overlap. The other term, 

J = J J <A?oo( r ' ) em e ( r ")V(r ' , r")<Anem f (r')«Aioo(r") d 3 r' d 3 r" (18.133) 
bears the descriptive name exchange integral. 

Exercise 18.24. Verify explicitly that 

A E ± = I ± J 

Exercise 18.25. To give an interpretation of the two nonstationary states 
<Aioo(r')</Wm f (r") and (r')^ioo(r") 

show that if the atom is initially in one of these, it shuttles back and forth with an 
exchange frequency J!h. [Compare (8.81).] 


For our repulsive interaction potential, V(r', r") = e 2 /|r' — r"|, the direct in- 
tegral I is positive. Calculation shows that the exchange integral J has a small pos- 
itive value in this case. Hence, all (spatially symmetric) singlet terms lie slightly 
higher than the corresponding (spatially antisymmetric) triplet terms. This behavior 



0 


Chapter 18 Bound-State Perturbation Theory 


understandable, since in the spatially antisymmetric state the probability is small 
r the electrons to be found near each other, while in the spatially symmetric state 
ey have a greater opportunity to repel each other, thereby raising the energy. 

Since, owing to the different spatial symmetries, helium atoms in singlet states 
e physically quite different from those in triplet states, a special terminology is 
ten used. Helium atoms in singlet states are said to form parahelium, whereas 
ose in triplet states form orthohelium. The lowest energy level of parahelium is 
e normal ground state. 


wblems 

L The Hamiltonian of a rigid rotator in a magnetic field perpendicular to the x axis is 
of the form AL 2 + BL Z + CL y , if the term that is quadratic in the field is neglected. 
Obtain the exact energy eigenvalues and eigenfunctions of the Hamiltonian. Then, 
assuming B » C, use second-order perturbation theory to get approximate eigen- 
values and compare these with the exact answers. 

1. A charged particle is constrained to move on a spherical shell in a weak uniform 
electric field. Obtain the energy spectrum to second order in the field strength. 

3. Apply perturbation theory to the elastically coupled harmonic oscillators of Prob- 
lems 6 and 7 in Chapter 15, assuming that the interaction between the two particles 
is weak, and compare with the rigorous solutions. 

4. Use second-order perturbation theory to calculate the change in energy of a linear 
harmonic oscillator when a constant force is added, and compare with the exact 
result. 

5. A slightly anisotropic three-dimensional harmonic oscillator has cd z ~ a> x — a> y . A 
charged particle moves in a field of this oscillator and is at the same time exposed 
to a uniform magnetic filed in the x direction. Assuming that the Zeeman splitting 
is comparable to the splitting produced by the anisotropy, but small compared to 
hoj, calculate to first order the energies of the components of the first excited state. 
Discuss various limiting cases. 

6. Prove that if tp 0 = e~ <Po is a positive bounded function satisfying appropriate bound- 
ary conditions, it represents the ground state for a particle moving in a potential 

V = ~ [(V<p 0 ) 2 - V 2 <p 0 ] + E 0 


and the corresponding energy is E 0 . Verify the theorem for (a) the isotropic harmonic 
oscillator and (b) the hydrogen atom. 

7. Apply the theorem proven in Problem 6 to a particle of mass m in one dimension, 
with the assumed ground state. 


if/ 0 (x) = 


C 

cosh KX 


and a corresponding energy eigenvalue E 0 = -h 2 K 2 /2m. Plot the resulting potential. 

8. Prove that the trace of the direct product of two matrices equals the product of the 
traces of the matrices. Apply this result to show that the “center of gravity” of a 
multiple! split by the spin-orbit interaction is at the position of the unperturbed 
energy level. 
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9. Obtain the relativistic correction °c p 4 to the nonrelativistic kinetic energy of an 
electron, and, using first-order perturbation theory, evaluate the energy shift that it 
produces in the ground state of hydrogen. 

10. Using the Hamiltonian for an atomic electron in a magnetic field, determine, for a 
state of zero angular momentum, the energy change to order B 2 , if the system is in 
a uniform magnetic field represented by the vector potential A = B X r/2. 

Defining the atomic diamagnetic susceptibility x by E = — ^B 2 /2, calculate x 
for a helium htom in the ground state and compare the result with the measured 
value. 


11. Apply second-order perturbation theory to a one-dimensional periodic perturbing 
potential 


+ oo 

V(x) = 2 V n e 2vinx ' s 

n= — oo 


with period g. To enforce closely spaced discrete energies, assume that the entire 
“crystal” has length L = Ng and use the periodic boundary condition (x + LI 2) = 
^ (x — LIT), where N is a large even number. Assume that the zero-order Hamil- 
tonian is that of a free particle subject to these boundary conditions. Show that 
nondegenerate perturbation theory breaks down at the band edges and that the for- 
bidden energy gaps are proportional to the Fourier coefficients of the potential. 

Show that in the limit L —> oo, carried out as in Section 1 of the Appendix, the 
unperturbed (unnormalized) energy eigenfunctions are plane waves, e lkx , and derive 
the second-order approximation to the dispersion function E(k). 


12. Assuming that the zero-order free particle energy is large compared with gig, apply 
the results of Problem 1 1 to the special example of a Kronig-Penney potential with 
attractive 5-functions of strength g. In the middle of the valence band at kg = tt!2, 
verify the result of perturbation theory by expanding the exact eigenvalue condition 
(see Exercise 8.29). 

13. Let if/ be a variational trial function for the ground state i f/ 0 of a system with non- 
degenerate energy eigenvalues. Assume that if/ and if/ 0 are real, normalized to unity, 

and that [ if/if/ 0 d 3 r is positive. Show that 



- if/ o) 2 dr’r < 1 — 



(H) ~ E 0 
Ei ~ E 0 ( 


i/2 


1 (H) - E 0 

2 Ei — E 0 


where E 0 and E x are the exact energies of the ground and first excited states, and 
{ H ) is the expectation value of the Hamiltonian in the state ip. Estimate the accuracy 
of the trial functions used in Section 17.8 for the ground state of the helium atom. 

14. A rotator whose orientation is specified by the angular coordinates 6 and <p performs 
a hindered rotation described by the Hamiltonian 

H = AL 2 + Bh 2 cos 2<p 


with A » B. Calculate the S, P, and D energy levels of this system in first-order 
perturbation theory, and work out the corresponding unperturbed energy eigenfunc- 
tions. 
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Time-Dependent Perturbation Theory 


When the nature of a physical system and its Hamiltonian is known, we 
are often confronted with the problem of predicting the time development 
of the system from its initial condition. The general formalism of 
quantum dynamics was established in Chapter 14 and applied in Chapter 
15 to some problems solvable in closed form. Commonly, however, one 
must resort to perturbation methods. The interaction picture lends itself 
naturally to the formulation of perturbative schemes. While deferring a 
systematic treatment of collisions, which are typical time-dependent 
processes, to Chapter 20, here we study the relatively weak interaction 
between electromagnetic fields and charged-particle systems as illustration 
of time-dependent perturbation theories. In the examples, we usually do 
not go beyond the first nonvanishing order. Fermi’s Golden Rule for 
constant transition rates and its relation to exponential decay emerges as a 
central result. 


1. The Equation of Motion in the Interaction Picture. If H is a time-independent 
Hamiltonian and all solutions of the eigenvalue problem 

H\V n ) = E n \^ n ) (19.1) 

are known, it is straightforward to construct the solution of the equation of motion, 

ih d ^ f) = ff|<P(f)) (19.2) 

dt 

which coincides at t — t 0 with the given initial state |'^ r (f 0 )). Under these circum- 
stances, the state vector 

1^(0) = e~ (M)m ~ to) I ■'F(ro)} = 2 (i 9 .3) 

n 

represents the general solution of the equation of motion, (19.2). For example, in 
Section 13.2 this procedure was applied to the scattering of a wave packet by a fixed 
potential. 

Usually, H is a complicated operator, and rigorous solutions of (19.1) are not 
available except in special cases. Stationary state perturbation theory could be used 
to obtain approximate eigenvalues and eigenstates of H for substitution in (19.3). 
Although this procedure is in principle perfectly suitable if H is time-independent, 
in this chapter we will take up a mathematically simpler approach, which has the 
additional advantage of lending itself to immediate physical interpretation. 

Since our perturbation method will be applied directly to the time -dependent 
equation (19.2), H need not be a time-independent operator. Indeed, perturbations 
that are explicitly dependent on the time are of considerable importance, as illus- 
trated by the forced linear harmonic oscillator. The excitation or ionization of an 
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atom, molecule, or nucleus by a varying electric field is another obvious example, 
and we will work out the cross sections for excitation by a passing charged particle 
and for absorption of light. When H depends on t, there are no strictly stationary 
states, and the energy of the system alone is not conserved. Hence, the simple for- 
mula (19.3) is inapplicable. 

It is characteristic of the physical processes mentioned — scattering of a particle 
by a fixed potential and absorption of energy by a system from a passing distur- 
bance — that the perturbing interaction is limited in space and time. The system is 
unperturbed before it enters the scattering region or before it is hit by a wave, and 
it is again free from the perturbation after sufficient time has elapsed. This sequence 
of events suggests that the total Hamiltonian again be considered as the sum of two 
terms: 

H ~ H 0 + V (19.4) 

where the time-independent operator H 0 describes the unperturbed system, and V is 
the perturbation that may be explicitly time-dependent (as in the case of excitation 
by a transient electromagnetic field) or not (as in the case of potential scattering). 
In either case, the initial state is usually, at least approximately, an eigenstate of H 0 . 
The forced linear harmonic oscillator, for which exact solutions were worked out in 
Section 14.6, may serve to test any approximation method to be developed. 

The basic idea of time-dependent perturbation theory is simple. It assumes 
knowledge of the solutions of the unperturbed eigenvalue problem, 

H 0 \n) = E„\n) (19.5) 

Our objective is to calculate transition amplitudes between the relevant unperturbed 
eigenstates, owing to the presence of the perturbation V. 

The approximations are most readily carried out in the interaction picture. We 
therefore recapitulate the essential formulas from Section 14.2, modifying the no- 
tation slightly for our present purposes. The equation of motion in the Schrodinger 
picture, 


ih = (Ho + 


(19.6) 


is transformed to the interaction picture (indicated as in Chapter 14 by a tilde over 
the state vectors and operators) by a time-dependent unitary operator: 


1^(0) = exp^- H 0 tJ\y(t)) 

In the interaction picture, the equation of motion for the state is 

d\nt)) 


ih 


dt 


= %)l^(0> 


where the new interaction operator is given by 


V(t) = expf^- H 0 tjv expf H 0 t 


(19.7) 


(19.8) 


( 19 . 9 ) 
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n order to derive general perturbation theory expressions, independent of the par- 
icular choice of an initial state, we introduce the time development operator f(t, t 0 ) 
;uch that 

I nt)) = tit , t 0 )\nt 0 )) (19.10) 

The time development operator satisfies the integral equation 

Tit, t 0 ) = 1 - ~ f V(t')f(t', t 0 ) dt' (19.11) 

n Jt 0 

This operator is related to the time development operator, T(t, t 0 ), in the Schrodinger 
picture by 

f(t, t 0 ) = e m)H «T{U to)e~ <mH ^ (19.12) 

We are interested in the transition matrix elements of these time development 
operators between the eigenstates of the unperturbed Hamiltonian. In terms of matrix 
elements, (19.11) becomes 

(k\f(t, t 0 )\s) = S ks - ^ X f exp 7 ( E k - E n )t' (k\V(t')\n)(n\T(t' , t 0 )|s) dt' 

(19.13) 

This exact equation of motion is a suitable starting point for approximations. The 
possibility that V may depend on the time has been explicitly allowed for. Introduc- 
ing the transition frequency by 

hco kn — E k — E n (19.14) 

we obtain from (19.13), 

(19.15) 

We note that the matrix elements of the time development operator defined here 
are simply related to the transition amplitudes defined in Eq. (14.51): 

(k\T(t, to)\s) = ( k , t\T(t, t 0 ) Is, t 0 ) - e-^'iklm t 0 )\ s)e (i/h)£ ^ (19.16) 

The probabilities deduced from these two kinds of amplitudes are obviously the 
same. 

Before proceeding, we finally recall from Section 14.6 a compact, though not 
generally useful, formula for the solution of the equation of motion (19.11): 

T(t, t 0 ) = T exp -j\ V(t') dt' (19.17) 

n Jt 0 

where T stands for the time ordering instruction (14.129). 

The integral equations (19.15) are equivalent to a system of simultaneous linear 
homogeneous differential equations. Omitting for brevity the time arguments of the 
f and V operators, we can write this system in matrix notation as 
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ih 


dt 


/<1|?|1> <1 j T\ 2) <l|f|3> 
<2|fjl) <2 1 T\ 2) <2|f|3) 
(3|?| 1) <3|?|2> (3|f|3) 


\ 


/ <1|V|1> 

s <1 1 V\2)e i( ° l2t 

<i|v|3y tt>i3f . 

■\ 


<2| yj l)^'" 12 ' 

<2 1 V| 2) 

<2 j y| 3)e'" 23 ' . 



(3| Vj l)e - '" 13 ' 

(3\V\2)e- ia>23t 

<3 1 V| 3) . 




• 

• 

■I 

\ 


/<l|f|l> 

<l|f|2> 

<I|f|3) 

<2|f|l>. 

<2|f|2) 

<2|f|3) 

<3|f|l) 

<3|f|2) 

<3|f|3) 

l ■ 


(19.18) 


It is in the solution of this complicated linear system that the perturbation approx- 
imation is invoked. The method takes its name, time-dependent perturbation theory , 
from the fact that it solves directly the time-dependent equations of motion rather 
than the time-independent Schrodinger equation. The perturbation V may or may 
not depend on the time. 


2. The Perturbation Method. The strategy of time-dependent perturbation theory 
is to assume that V is small and proceed with iterating Eq. (19.11). First, T — 1 is 
substituted under the integral; the approximate T so obtained is substituted again in 
the integrand, and so on. The result of this repeated iteration, already carried out 
earlier in Section 14.6 with a slightly different notation, appears as a power series 
in terms of V': 

Tit, t 0 ) = 1 -jf V{t') dt' + f-j) f V(t') dt' f V(t") dt" + ... (19.19) 
n Jt 0 \ n Jt 0 Jt 0 


Approximate solutions of the equation of motion are obtained if this power series 
is arbitrarily terminated. 

Substitution of (19.19) into the integrand of (19.13) produces the desired per- 
turbation expansion of the transition amplitude, 


(k\f(t, t 0 )js> = 


- l - J* e io),at ' {k\Vit')\s) dt' 

+ E f dt' e io>knt '{k\ V{t')\n) f 

\ nf n J‘o Jt 0 


dt" e ia » r (n\V(t")\s ) + ... 

(19.20) 


If k ± s, the transition amplitude in first-order perturbation theory is given by the 
simple and much-used formula, 


(k\T(t, to) | s) = e m)Ek, {k\ T{t, t 0 )\s)e~ m)E ^ = - j (k\Vit')\s)e iw * t ' dt' 

n Jt 0 


(19.21) 


If the initial state of the system is an eigenstate of the unperturbed Hamiltonian H 0 , 
corresponding to the unperturbed energy E s , which is generally not an eigenstate of 
the perturbed Hamiltonian, the time development is not simply oscillatory and the 
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turbation induces transitions between unperturbed energy eigenstates. How this 
d of an initial state is adapted to the real physical conditions that prevail in an 
jeriment will be shown later in several examples and applications. If the pertur- 
:ion is sufficiently small during the period from t 0 to t, the probability for the 
item to be found in a different eigenstate of H 0 with energy E k is 

i r l 2 

P k Ut)= -T (klViO^e^ dt' (19.22) 

n Jto 

necessary condition for this approximate result to be valid is that the time interval 
im t 0 to t be short enough for the transition probabilities to remain small compared 
unity. 

If the perturbation is transient and sufficiently weak in magnitude, the transition 
lplitudes may remain small and the first-order approximation valid throughout all 
aes. In this case, the time development operator T(t, t 0 ) is independent of t 0 and 
provided that t 0 is a time preceding the onset of the perturbation, and t is a time 
ter the perturbation ceases. We may then with impunity take the limits t 0 — » — °c 
d t -» +oo and calculate the transition amplitude from one unperturbed energy 
genstate to another by the Fourier integral, 

-°°)U> = ~T f (k\V{t)\s)e io> ^ dt (19.23) 

n J-» 

le transition probability P k ^. s (+ °°) is thus proportional to the square of the abso- 
te value of the Fourier component of the perturbation matrix element (k[V(0|s) 
'aluated at the transition frequency (o ks . 

s 

Exercise 19.1. From the integral equation (19.15) and from (19.21), derive 
e corresponding linear differential equations for the transition amplitudes, and 
jmpare the perturbation version to the exact equations. 

Exercise 19.2. Apply first-order perturbation theory to a forced linear har- 
Lonic oscillator which is initially in the ground state, and compare the transition 
robability with the exact result in Eq. (14.146). Calculate the energy transfer to the 
scillator exactly and also in perturbation theory. Explain the agreement. Show that 
lassical mechanics gives the same result for an oscillator that is initially at rest! 

Exercise 19.3. Using the first-order formula (19.22) for the transitions from 
state s to states k =£ s, calculate the probability that the system remains in state s. 
how that the result agrees with P J< _ J (f) calculated from (s|f(t, t 0 )\s) only if the 
econd-order terms are retained in the perturbation expansion of this diagonal matrix 
lement. 

If the approximation (19.21) is too drastic to be useful, it can be significantly 
mproved by taking the interactions of a selected number of states more accurately 
nto account. This procedure is analogous to the treatment of the nearly degenerate 
mperturbed state in Section IS. 5. As in all perturbation theories, the chief criterion 
>f success is the consistency of the solution with the simplifying assumptions made 
n arriving at the approximate equations. If a proper choice of closely coupled states 
s made, the perturbation amplitudes may turn out to be sufficiently accurate for 
imes long enough to have physical relevance. 
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If, for example, r and s label two unperturbed energy eigenstates that interact 
strongly with each other but only weakly with all other states k, and if the initial 
state |¥(f 0 )> is a linear superposition of j r) and |s), Eqs. (19.21) are replaced most 
conveniently by a set of first-order perturbation equations in differential form. They 
are simplest when the transition amplitudes are expressed in terms of matrix ele- 
ments of the time development operator in the interaction picture. In the spirit of 
perturbation theory, we neglect on the right-hand side any transition amplitudes that 
link the initial state |' v F(r 0 )) to any unperturbed energy eigenstates other than | r) and 
| s), and write: 

ih j t (k\T(t, 

ih j t < r\T(t , , 0 )j*(a 

ih ^ <s|7(r, t 0 )\fy(t 0 )) 
dt 

An example of the use of these equations will be given in Section 19.5. 


= (kivioiry^nt, t 0 )\nt 0 )) 

+ <fc|y(0|s>e l Ms|m t 0 )\nt 0 )) (k A r, 5) 
= (r\V(t)\r)(r\f(t, t 0 )\V(t 0 )) 

+ <r | V(0 1 s)e ia>rst (s | T(t, t 0 ) \ ^(t 0 )) 

= {s\V(t)\r)e^(r\f(t, r„)|^(t 0 )) 

+ <s|V(0k>(s|f(f, t 0 )\^(t 0 )) 


3. Coulomb Excitation and Sum Rules. As an illustration of first-order time- 
dependent perturbation theory, we now study the effect of a transient Coulomb field 
on a compact bound system of charged particles, which we may call the target. 

As a concrete example, consider the excitation of a nucleus by a particle with 
charge Z x e which sweeps by. When charged particles, e.g., alpha particles of mod- 
erate energy, bombard a nucleus of atomic number Z 2 , the particles are deflected by 
the mutual Coulomb repulsion, and the familiar Rutherford scattering takes place 
(Section 13.8). In the process, the nucleus sees a rapidly varying electric field that 
can cause transitions from the ground state to some excited nuclear state. When this 
happens, we speak of Coulomb excitation in contrast to other inelastic processes 
that occur if the projectile is hadronic and fast enough to allow penetration of the 
Coulomb barrier, so that strong interactions through forces of origin other than elec- 
tromagnetic can occur. 

Here we will consider another example: the excitation and ionization of an atom 
by the passage of a charged particle. This is the dominant mechanism by which 
charged particles are slowed down in matter. 

In lowest approximation, it is frequently permissible to treat the projectile as a 
classical particle moving on a trajectory with a definite initial momentum and impact 
parameter in the field of the target atom. The perturbation interaction is simply of 
the form 


y = 




(19.25) 


where r t , r 2 , . . . , denote the position operators for the Z 2 electrons in the target, 
and R(f) is the position vector of the projectile with charge ~Z x e. It is convenient 
to choose the atomic nucleus as the coordinate origin. 
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If the projectile never enters the target, an electric multipole expansion of the 
rm 

I iF^ <19 ' 26) 

appropriate. Substituting this into (19.23), we obtain the transition amplitude, 

a . + . <‘l | rjr t », ■ R(0) | s) 

-)!.> = 2 J_. ^ - - - * (19.27) 

the distance between the projectile and the target atom is always large compared 
ith the size of the atom, the multipole expansion converges rapidly, and only the 
rst nonvanishing term in the power series in (. in (19.27) need be retained. For a 
iven initial state s and a final state k, there is always a smallest value of €, for 

hich the matrix element (£| ^ r\ P f (r ( - • R(f))|s) will not vanish. Since P 0 — 1, 

i=i 

id |fc) and |s) are orthogonal states, the matrix elements for i = 0 (monopole 
ansitions) are zero. The next term, € = 1, corresponds to electric dipole or El 
ansitions, € = 2 to quadrupole or E2 transitions, and generally we speak of 2^-pole 
r Et transitions. Higher multipole transitions are important in nuclear Coulomb 
xcitation, but in the atomic case electric dipole (€ = 1) transitions strongly dom- 
late, and we limit the discussion to them. 

If the plane of the orbit is taken to be the xz-coordinate plane, we have (< p = 0) 

riPifii ■ R(t)) = r,- • R(r) — x t sin 6(t) + z { cos 6(t) (19.28) 

^here 6(t) is the instantaneous polar angle of the projectile (see Figure 19.1). If we 
ssume that the projectile moves in a central field originating from the target, con- 
ervation of angular momentum may be invoked in the classical description of this 
aotion, and we have 

i 

d6 

[i?(03 2 ~= -bv (19.29) 

dt 


X 



Figure 19.1. The Coulomb field of a charged particle Zj passing near a charged target Z 2 
:auses internal excitation. The impact parameter is b, and the coordinates 0(f) and R(t) 
determine the instantaneous position of the projectile. 
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where v is the initial velocity of the incident particle and b its impact parameter. 
Substituting (19.28) and (19.29) into (19.27), we obtain for the dipole transition 
amplitude 

/Z, e 2 C e(b) 

(&|?(+°°, — °°) I s) ~ + - -- e l<0kst ((k\x\ s) sin 9 + (fc|z|.s) cos 9) dd (19.30) 

TlVb J TT 

where the components of the dipole matrix elements of the target atom are defined 

by 

<*|r|j) = (k\ X r,-|j) (19.31) 

i= 1 

The integration in (19.29) extends to the final scattering angle 9(b), which corre- 
sponds to the prescribed impact parameter, b. Although the integrand in (19.29) 
looks simple, it should be remembered that t is usually a complicated function of 9, 
and the integral can be evaluated only if the classical motion of the projectile is 
known. 

For some purposes, it is a good approximation to assume that the exponent co ks t 
in the integrand of (19.30) is negligible. This approximation is valid if the collision 
time, during which the projectile is close enough to the atom to be effective in 
exciting the latter, is short compared with \/(*) ks , which is a typical period associated 
with the proper oscillations of the atomic electrons. In the limit oy ks t — > 0, (19.30) 
can be integrated easily and yields 

(fc|f(+°°, -oo)| 5 ) = + Ifob {(fc|x|s)[cos 9(b) + 1] - (£|z|s) sin 9(b)} (19.32) 

Exercise 19.4. If the transition amplitude is given by the approximation 
(19.32), show by using closure that the total transition probability from an initial 
spherically symmetric atomic state s is 

^ 4Z?g 4 

2 JW+ 00 ) = ,» 2 2,2 <*k 2 k> cos 2 9(b)/ 2 (19.33) 

k±s U u 

Make an estimate of the magnitude of this transition probability for 1 MeV protons 
incident on hydrogen atoms at an impact parameter b comparable to the Bohr radius. 

The average energy transfer A E in inelastic collisions is defined as 

A £ = 2 (E k - E s )P s ^ k (+ 00 ) = '2(E k - E s )\(k\T(+™, -co)[s>| 2 (19.34) 

k k 

where the sum can be taken over all states of the target atom (since the term k — s 
contributes nothing). When (19.32) is substituted in (19.34), we are typically faced 
with the need to evaluate sums involving dipole matrix elements, such as 

Sy = Ti"Z(E k - ^)[<j|xi|*)<fc|jCy|j) + (s | Xj | k)(k | x,- 1 5 )] (19.35) 

n k 


This can be written as 
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' the unperturbed atomic Hamiltonian H 0 contains the momentum only in the kinetic 
rergy operator, we have 

ih fi 2 

[Xi, H 0 \ = — pi and [[x,-, H 0 \, x y \ = — 8# 
m m 

mce, the useful identity 1 


Sij = 8ij (19.36) 

pplying this relation to a component of the dipole matrix elements of a one-electron 
om (Z 2 = 1), we thus obtain 

TT S - £,)|<*|x|*>| 2 = — 2 <o b \(k\x\s)\ 2 - 1 (19.37) 

n k n ic 


Dr a multi-electron atom with dipole moment (19.31), this identity can be gener- 
ized to the dipole sum rule. 


2 a> fa |<*Wj)| 2 = ^2 

n k 


(19.38) 


he dimensionless quantity 


fb = y «> b |<tk|j>l 2 


(19.39) 


a useful number in spectroscopy. If the transition s — » k were between the two 
west energy levels of a harmonic oscillator, with level spacing h(o ks , we would 
ad fks= ±1, the sign depending on which of the two levels is higher. Therefore, 
:5 is known as the oscillator strength for the transition between s and k with fre- 
lency a> ks , and (19.39) is the Thomas-Reiche-Kuhn sum rule for the oscillator 
rengths: 

i 

E fks = Z 2 (19.40) 

k 


Exercise 19.5. For a hydrogen atom that is initially in the 15 ground state, 
>mpute the oscillator strengths to the lowest three excited states. What fraction of 
e total do these oscillator strengths add up to? (Use the z coordinate, rather than 
to simplify the calculations.) 


The approximate average energy transferred by a fast charged particle in a 
stant (large-impact-parameter) collision to an atom is, according to (19.32), 
9.34), and (19.38), given by 


A E 


2 Z\Z 2 e A 
mv 2 b 2 


cos 2 


m 

2 


hich does not contain Planck’s constant and is essentially the same result as a 
assical calculation would give. The concepts (oscillator strength, dipole sum rules) 


'For more information on atomic sum rules see Bethe and Salpeter (1957), Sections 61-63, and 
sthe and Jackiw (1968), Chapter 11. 
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introduced in this section will find useful application when we turn our attention to 
the interaction of radiation with matter. 


4 . The Atom in a Radiation Field. As a further illustration of the use of (19.23), 
it is instructive to calculate the absorption cross section of an atom for light. The 
electron spin will be neglected, and for simplicity it will be assumed that only one 
atomic electron is involved in the interaction with the incident radiation and that 
the nucleus is infinitely heavy. The results can be generalized to other systems 
(molecules, nuclei) that can absorb radiation. 

The Hamiltonian of the atomic electron (charge —e) in an electromagnetic field 
is given by (4.93) as 

p + - — e<p (19.41) 



In the coordinate representation, this becomes the operator 


fl ^ £ 

H = V 2 - ecf)(r, t) + — A(r, t ) 
2m me 


h 


— [V • A(r, t)\ + [A(r, t)f 


2 me 


2 me 


(19.42) 


We will apply this Hamiltonian only under conditions that allow the A 2 term to be 
neglected. 

It is convenient to separate the electromagnetic field that represents the incident 
light wave from all other contributions to the Hamiltonian, whether or not their 
sources are electromagnetic, such as the Coulomb interaction within the atom. Only 
the effect of the pure radiation will be treated as a perturbation, and all other forces 
that act on the electron are incorporated into the V 0 term of the unperturbed Ham- 
iltonian, 

Ho = ~+V 0 (19.43) 


According to Maxwell’s theory, by choosing a suitable gauge the pure radiation 
field that perturbs the atom can be described in terms of a vector potential A alone. 
Since this field has no sources in the vicinity of the atom, it may in the so-called 
Coulomb gauge be derived from potentials which in Cartesian coordinates satisfy 
the equations 

1 d 2 A 

V 2 A — = 0, <f> = 0, V • A = 0 

With this choice, the complete Hamiltonian simplifies to 

H = + V 0 + — A-p = tfo + — A-p (19.44) 

2m me me 

The last term will be regarded as an external perturbation dependent on r and t. If 
V 0 represents the potential energy of a harmonic oscillator and if the spatial variation 
of A can be neglected, (19.44) is the Hamiltonian of a forced harmonic oscillator, 
which was treated in detail and without approximations in Section 14.6. 
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Exercise 19.6. Prove that the perturbation term in (19.44) is Hermitian. 


It is convenient, and corresponds to actual experimental conditions, to suppose 
at the incident radiation can be described as a very broad plane wave packet mov- 
g in a given direction. Any such wave can be written as a superposition of har- 
onic plane waves. 


/*+ CO 

A(r, t ) = I A(o>) exp 



(19.45) 


here n is the unit vector pointing in the direction of propagation. The vector po- 
ntial must be a real quantity, which requires that 

A*(o>) = A( (d) (19.46) 


n the other hand, the solenoidal character of the vector potential (V ■ A = 0) im- 
ies that the wave is transverse: 


n • A(<u) — 0 


(19.47) 


he perturbation, 


=- f 


exp 


— l(x)\ t — 


n • r 


A(o>) • p do) 


now substituted into (19.23) to give the transition amplitude, 

:|Jt+oo, -oo)| 5 ) 

/* + oo /•+ CO / ^ \ 

— (&|exp( / — n • r JpJ-s-) • A(cn) exp[/(fi> if — (d)t ) dt dto 

Imc J-~ \ c ) 


fimc 

2me 

hmc 

2me 


j (k|exp^i — n • r^p|.y) • A(cu)5(&> — to fo ) dto 


hmc ^l exp ( 1 ' r ) p l s ) ' 


(19.48) 


(19.49) 


he last expression shows that, to this order, the only Fourier component of the 
icident radiation that is effective in the absorption processes leading to the final 
;ate k corresponds to the frequency 


Ei, - E, 


0>ks = 


h 


(19.50) 


r agreement with the frequency condition originally postulated by Bohr. The energy 
ifference ha) ks is given up by the light pulse to the absorbing system, but the change 
rat the radiation field undergoes in this energy transfer can be described only if the 
lectromagnetic field, instead of being treated as an external prescribed driving 
arce, is made part of the dynamical system and thus is itself amenable to quanti- 
ation. The quantum theory of the radiation field — historically, the platform from 
fhich Planck and Einstein launched quantum theory in the first place — will be de- 
eloped in Chapter 23. It should always be remembered that (19.49) holds only if 
re light wave is sufficiently weak so that first-order perturbation theory is valid for 
re entire duration of the pulse and the transition probability is much less than unity, 
anticipating the language of the quantized radiation field, we may interpret (19.49) 
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as the probability amplitude for annihilation and absorption of one photon of energy 
ft Oiks- 

The light pulse also induces inverse transitions if the atom is initially in state 
k. The transition amplitude to the state s of lower energy is given by 

<s|7(+“, -oo) |it) = - l - [ (s\V(t)\k)e i(a * dt (19.51) 


Since V is a Hefmitian perturbation operator (see Exercise 19.6) and a) sk — — a> ks , 
it follows that in first-order perturbation theory 

(s| 7’(+ oo, — oo)|£) = —(k\T(+°°, — oo)|,y)* (19.52) 

showing that in this approximation the two transition probabilities are equal. This 
property is known as the condition of detailed balancing. Since E k > E s , the excess 
energy h(o ks is transferred to the radiation field. A proper quantum treatment of the 
radiation field in Chapter 23 will confirm that this energy appears as light of fre- 
quency (o ks . The transition from k to s, induced by the applied radiation pulse, is 
known as stimulated emission. If the radiating system is near thermal equilibrium, 
stimulated emission between two energy levels is usually much less intense than 
absorption despite the equality of the transition probabilities, because the initial 
population of the upper level is generally smaller. In masers and lasers, however, a 
population inversion is achieved, and stimulated emission predominates. In the lan- 
guage of the quantized radiation field, stimulated emission corresponds to the cre- 
ation of a photon with energy hto ks . 

For actual calculations of transition probabilities using (19.49), we may assume 
the pulse (19.45) to be linearly polarized, which implies that all frequency compo- 
nents A(w) have the same direction independent of <w. No loss of generality is in- 
volved in this assumption, if the cross section for absorption is calculated, since in 
an actual experiment a large number of incoherent pulses of the form (19.45) are 
directed at the system. Incoherence here simply means that the phases of the pulses 
are arbitrary and independent of each other. According to the wave equation, each 
Fourier component is elliptically polarized. Each elliptical oscillation can be decom- 
posed into two linear ones directed along two fixed perpendicular axes, both of 
which are orthogonal to the propagation vector n. Because of the uniform random- 
ness of the phases, the interference terms that arise between the two linear oscilla- 
tions upon squaring the amplitude (19.49) will cancel when an average over many 
pulses is performed. Hence, the total absorption probability is the sum of two ex- 
pressions of the form K&ITX+qq, — 00 )|^)| 2 , obtained by squaring (19.49), one for 
each direction of polarization. 

If we adopt a definite direction of linear polarization e and write 

A(o>) = A(w)e 


the transition probability becomes 


k*-s 


- |<fc|f(+oo, -° 0 )[s>| 2 

4n 6 \A((o ks )\ 2 \{k\exJ i — n • r)p • e\s)\ 2 


(19.53) 


fi 2 m 2 c 2 


This transition probability can be interpreted as the probability for absorption (or 
emission) or a photon, or the mean photon number absorbed by the atom from the 
pulse. 
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This result can also be related to the intensity of the incident pulse by comparing 
vith the Poynting vector, 


N = c 


E X H 


1 dA 


Air Air dt 

(t)A(oj) e exp 


X (V X A) 


J_ f 

■ 1 TC „ - 


47TC J- 

/•+ OO 

X I X e) exp 


— Jfct) t — 


n • r 


dco 


(19.54) 


— i(x>'\ t — 


n • r 


do)' 


e triple product, e X (n X e) = n — e(e • n), simplifies because the wave is 
nsverse, whence e • n = 0, giving the equation 


V = 


1 A 


47TC 


Cj: 


deo dco' o)(o' A(o))A(o)') exp 


— i(o) + o)')\ t — 


n • r 


(19.55) 


follows that the energy flows in the direction of propagation and that the total 
;rgy which the pulse caries through a unit area placed perpendicular to the direc- 
n of propagation is given by 


f- 


N • n dt = f do) o) 2 \A(o))Y 
2c J-oo 


1 rco co 

— - I do) o) 2 \A(o ))\ 2 = N(o)) do) 

c Jo o 


(19.56) 


nee, the energy carried through a unit area per unit frequency interval is 


O) 


N(o)) = — \A(a>) p 
c 

■mbining this result with (19.53), we obtain 


(19.57) 


P k<-.s — 


Air e 


2.2 


\(k\ expl i — A • r p • e [ s) | 2 M«**) 


>2 2 2 1 
n m c(ots 


(19.58) 


Energy transfer from the incident radiation pulse to the atom takes place only 
those frequencies that the system can absorb in accordance with the Bohr fre- 
ency condition (19.50). The average energy loss of the pulse at the frequency o) ks 


ho) ks P k ^ s = — j— | (k|exp(i — n • r )p • e|^)| 2 A(&j fa ) (19.59) 

m o) ks \ c J 

lere a denotes the fine structure constant, a = e 2 /hc ~ 1/137. 

It is customary to base the definition of an absorption cross section for linearly 
larized electromagnetic radiation of a certain frequency on (19.59). [Equivalently, 
! can use (19.58) and think of the absorption cross section as measuring the atom’s 
Dpensity for absorbing photons.] As explained in Section 13.1, the absorption cross 
;tion is a fictitious area which, when placed perpendicular to the incident pulse, 
exposed to and traversed by an amount of radiation energy (or a mean number of 
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photons) equal to that absorbed by the atom or nucleus. This notion of the cross 
section is useful only if it is independent of the detailed shape of the incident pulse. 
Besides the nature and initial condition of the absorbing system, it should depend 
only on the frequency, the direction of incidence, and the state of polarization of 
the incident beam. Under the usual experimental conditions, these goals can be 
achieved and a cross section cr((o) defined, such that the energy absorbed in the 
frequency interval dto is 

A 

a(w)N(w) doj (19.60) 

Absorption cross sections corresponding to two diametrically opposite situa- 
tions will now be evaluated: 

1 . The initial and final states are discrete, and the absorption spectrum consists 
of sharp lines (Section 19.5). 

2. The initial state is again discrete, but the final state is in the continuum. This 
is the photoelectric effect, in which an atomic electron is released and o-(co) is a 
slowly varying function of the frequency (Section 19.6). 

5. The Absorption Cross Section. If the states of s and k are discrete, radiation 
can be absorbed only if its frequency lies very close to oi ks . Hence, cr(<y) is a strongly 
peaked function of a). The total energy absorbed is obtained by integrating the ex- 
pression (19.60), and this quantity can then be equated to the right-hand side of 
(19.59): 

f a((o)N(t o) d(o — — y- a |{fc|exp( i — n • r )p • e|s)j 2 /V(« fa ) (19.61) 

Jaw m o) ks \ c ) 

Here the integral on the left is to be extended over an interval A<w that completely 
contains the spectral line (but no other absorption frequencies). Since or(w) has a 
steep and narrow maximum at a) — co ks , a reasonable definition of the cross section 
can be obtained if N(a)) varies relatively slowly. If we restrict ourselves to pulses 
broad in frequency compared with the line, the left-hand side of (19.61) can be 
written as 


cr((d)N(a > ) d(o = N(o ) ks ) I o~((o) dco 

jAo) JAO) 


and an integrated absorption cross section can be obtained: 


l 


a(co) da> — ^ |(k|exp(i — n • r jp • e|s)|" 

Aw m(i) ks \ c ) 


(19.62) 


Despite its somewhat misleading name, the integrated absorption cross section is 
not a geometric area; it is the product of an area and a frequency and is measured 
in the corresponding units. 

Further progress in the evaluation of this general cross-section formula depends 
on specific assumptions about the structure of the unperturbed Hamiltonian H 0 and 
its eigenstates s and k. The coordinate origin is conveniently chosen at the center of 
mass of the atom. Generally, it is possible to make an important approximation based 
on the fact that the wavelength of the incident light, A = 27rc/(o ks , is large compared 
with the linear dimensions of the absorbing system. In this case, the exponential 
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>erator in the matrix element of (19.62) is advantageously expanded in a power 
ries 

exp^z — n • rj = 1 + i — n • r + ... (19.63) 

d only the first nonvanishing term in the corresponding series of matrix elements, 

n • r^jp • e|s) = (k|p • e|s) + (k\i — n • r p • e|.s)+ . . . (19.64) 

retained. The ratio of two successive nonvanishing terms in this series can be 
timated by assuming that r is effectively of the order of magnitude of the radius 
the system. For an atom of radius a/Z (a: Bohr radius) the expansion parameter 
equal to 

at a ___ Ze 2 _ ^ Z 

cZ he 137 


(k|exp( i — 


d thus the series (19.64) converges rapidly for small Z. 

Selection rules for the wave functions of the unperturbed states determine which 
"m in the expansion (19.64) is the first nonvanishing matrix element. If the first 
:m in the expansion, (k\p • e|s), is different from zero, the transition is said to be 
the electric dipole or El type. Since in this approximation we effectively replace 
p (z <wn • r/c) by unity, the electric dipole approximation also implies a complete 
gleet of retardation across the atom. Owing to the inherent weakness of higher 
ultipolarity (“forbidden” transitions), electric dipole (or “allowed”) transitions 
e the most important ones in atoms. In nuclei the ratio rate is not quite so small, 
ice hco is of the order of h 2 /M n R 2 , where M n is the nucleon mass and R is the 
clear radius. Hence, the expansion parameter is 

to __ h _ Compton wavelength of nucleon 
c M n cR Nuclear radius 

lich for light nuclei can be as large as 1/10. The powerful detection techniques of 
clear physics make it easy to observe transitions of higher multipolarity. When 
ch transitions are considered, it is no longer permissible to neglect the interaction 
the electromagnetic field with the spin magnetic moment. 

In electric dipole transitions, the matrix element (k|p|s) is the decisive quantity 
at must be evaluated. This can be related to the matrix element of the position 
erator r, if the unperturbed Hamiltonian is of the form H 0 — p 2 /2 m + V 0 , and if 
commutes with r. Under these conditions 


[r, H 0 ] = tH q - H 0 r = ih - (19.65) 

m 

;nce, by taking the matrix element of both sides of this equation, 

irt 

<k|pl 5 > = ^ - £t 0) )<k|r[s> = (19.66) 

aking the dipole approximation and substituting (19.66) into (19.62), we obtain 
s integrated absorption cross section, 



da> — Att 2 aa> ks \{k\r ■ e|s)| 2 


(19.67) 
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For many purposes, it is useful to rewrite the transition matrix element as follows: 

4ir i-! 

(k |r ■ e\s) = — £ (-l)^|rn(f)k)Fr 9 (e) (19.68) 

5 q=~\ 

If the absorbing system is a one-electron atom, and if the fine structure of the atomic 
energy levels may be neglected, the quantum numbers ntm and n'Z'm' characterize 
the initial and filial states. Formula (17.74) may then be applied to carry out the 
integration over the angle coordinates in the evaluation of the matrix element, giving 
us 

(n'e'm' |r • e|n€m> = /y — - - ^ <€lm^|€l€'m')(€100|€l€'0)[F?(e)]*< r 

(19.69) 

where 

R n J = j o Rn'AWntirV dr (19.70) 

and R ni {r) is the real normalized radial wave function. The electric dipole selection 
rules 

A€ = €' — € = ±1, A m — m' — m = 0, ±1 

are an immediate consequence of (19.69). The values of the relevant Clebsch-Gordan 
(C-G) coefficients are 

<* , 0 °i"’ °> - JMi- ( ~ u 0} - 

Using this information, we may write 

\(n'{'m'\r ■ e|n£m)| 2 
4 tt 2£ + 1 
= ~3~ 2rVl {nm ’ 

for the last factor in (19.69). 

We may also calculate the average rate of absorption for a system (atom or 
molecule) that is exposed to a continuous frequency distribution of incoherent elec- 
tromagnetic radiation from all directions and with random polarization. These con- 
ditions may, for example, represent the absorption of blackbody radiation. Usually, 
only electric dipole transitions are important, and we restrict our formulas to these, 
although forbidden atomic lines are observed in stellar spectra. 

Applying the dipole approximation to (19.61), we find that the energy absorbed 
from a single polarized unidirectional pulse is 

N((o ks ) [ tr(fi>) da> = 4TT 2 auo ks \(k\r • e ; |.s}| 2 JV,(<o fa ) (19.73) 

where e, denotes the polarization vector of the z'th pulse, and Ni((o ks ) the intensity 
of its frequency component a) ks . Expression (19.73) must be summed over the n 
light pulses which on the average strike the atom per unit time. Since in blackbody 
radiation there is no correlation between e ; and W (&>**), the average of the product 


(19.71) 


(19.72) 
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;he product of the averages; hence, the rate of absorption from the radiation field 
given by 

R = 47rWI^|r-e|^)| 2 N^n (19.74) 

ere the bar denotes averages. The average intensity is related to the energy density 
: unit frequency interval, u{(d), by 

nN(w ) = cu((d) 

e average over the polarization directions e can be performed easily, since for 
rtesian components of e, 

j etejdOt = y 8 V (19.75) 

nee, 

\(k\r • e|s)| 2 = <fc|r|s> • e e • <fc|r|s>* = ^ | <A:|r|^)| 2 (19.76) 


d the average rate of absorption is 


R = — cuo ks c| <fc[ r I I 2 u((D ks ) 


Alternatively, the e-directional average 


(19.77) 
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iy be used to calculate from (19.72) and (19.74) the average rate of absorption by 
tial states of sharp angular momentum. The total transition probability from a 
/en state ntm to all the degenerate m-substates of the level n'€' is obtained by 
mming over the allowed values of m'. Since the C-G coefficients can be shown 
satisfy the relation 


Of' 4 - 1 

2 (€km, m' — m\£k€'m') 2 = — — (19.78) 

m' + 1 

follows from (19.72) that for €' = i + 1, 

2 \{n't'm'\v • e|n€m)| 2 = ± (R n J) 2 (19.79) 

d for €' = € - 1, 

2 \(n'-e'm'\r-t\n€m)\ 2 = ± (R ^' )2 (19-80) 

ibstituting these values in (19.74), we obtain the absorption rates or integrated 
sorption cross sections. Because the Clebsch-Gordan sum (19.78) is independent 
the quantum number m, the average absorption rates do not depend on the par- 
:ular initial magnetic substate. This consequence of rotational symmetry is tradi- 
mally known as the principle of spectroscopic stability. 


Exercise 19.7. For the IS — » 2 P (Lyman a) transitions in the hydrogen atom, 
flbifltp the. total infp.prate.d ahsnrntinn cross section (see Exercise 19.5). 
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A cross section, cr s e (<y), for stimulated emission (s.e.) can be defined in analogy 
with the absorption cross section, cr abs (<y), by equating to N(o))cr s ^(o)) do) the energy 
given off by an atom as a consequence of the passage of the light wave in the 
frequency interval do) around a). Since the transition probabilities for stimulated 
emission and absorption are the same, we see from (19.67) that for any initial 
state s, 

!>absO) - ^ S .e.(")] do) = 47 t 2 J 6>* s |(k|r • e 1 5- j ) 2 - 2 ■ e|£>j 2 

L E k> E S E k< E S 

provided that the sum is taken over all eigenstates of H 0 . From the sum rule (19.37), 
we then deduce for electric dipole transitions in one-electron atoms the general 
formula 

r°° he 2 

l>abs(<y) - tfke.O)] do) = 4 7T 2 a — = 2i t 2 — j c = 27 T 2 r 0 c (19.81) 

Jo 2m me 



where r 0 is the classical electron radius. This relation is independent of the direction 
of polarization and of the particular initial state and holds for any system for which 
the Thomas-Reiche-Kuhn sum rule is valid. 

For an atom with Z electrons, the sum rule (19.38) holds and gives 


-a 

Jo 


[o-absO) - GVe.O)] do) = 2tt 2 Zr 0 c 


(19.82) 


provided that all possible electric dipole transitions of the atom are included. Equa- 
tion (19.82) is identical with the result of a similar classical calculation. 2 

Sum rules for electric dipole cross sections can also be derived for nuclei that 
are exposed to gamma radiation, but care is required because nuclei are composed 
of charged and neutral particles. Since electric dipole absorption comes about as a 
result of the relative displacement of the center of charge of the nucleus and its 
center of mass, the dipole sum rule for nuclei depends critically on the relative 
number of protons and neutrons. 3 

So far only the integrated cross section Jcr(w) day has been discussed; the de- 
tailed frequency dependence of cr((d) was left unspecified. All that could be said 
about it was that i f \has a very pronounced peak at (o = eo ks . In the approximation 
used in this chapter, no absorption occurs unless the exact frequency component o) ks 
is represented in the incident beam. Hence, cr(o) ) must be proportional to a delta 
function and according to (19.61) must be given by 

<7(<o) — — y — |{k|exp(z — n * r Jp • e|s}| 2 8(ft> — (o fa ) (19.83) 

m o) ks \ c ) 

Equation (19.83) is, of course, an approximation. Observed spectral absorption lines 
are not infinitely sharp, but possess a finite width that the simple theory neglects. In 
actual fact, the excited state k is not, as was tacitly assumed, a stable and therefore 
strictly discrete energy level. Rather, it decays with a characteristic lifetime r back 
to the original level s, and possibly to other lower lying levels if such are available. 
Besides, an atomic level is depleted not only by emission of radiation but also, and 
often predominantly, by collisions with other atoms. In a more accurate form of 


2 Jackson (1975), Section 17.8. 

3 Blatt and Weisskopf (1952), pp. 640-643. 
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urbation theory, these interactions, which also cause transitions, must be in- 
led. Thus, the shape of the absorption line with its finite width can be predicted 
r if the subsequent return of the atom to its ground state is taken into account, 
tion 19.8 will show how interactions lead to the exponential decay of excited 
;s and to a more realistic expression for the line shape than (19.83). 

In retrospect, it may be well to stress some tacit assumptions that underlie the 
tment of absorption given in this section. We have assumed that the incident 
ation is weak, so that no atom is ever hit by several light pulses simultaneously; 
ict, we have assumed that the atom could settle down to an undisturbed existence 
comparatively long periods between pulses. A sufficient number of collisions in 
ie intervals ensure that almost certainly the atom is in the ground state when the 
t pulse arrives, or that at least thermal equilibrium has been restored, and each 
n is in a definite unperturbed energy eigenstate. We have also assumed that each 
se is sufficiently weak so that the probability of absorption per single pulse is 
lerically small. If, in violation of these assumptions, the incident radiation were 
/ intense, we would have to use a more accurate form of perturbation theory with 
result that, in the case of such saturation of the absorption line, additional broad- 
lg of the line would be observed. This broadening occurs because the incident 
ation stimulates emissive transitions appreciably, and furthermore there is amea- 
ible depletion of the ground state. The additional broadening of the absorption 
depends on the intensity of the incident radiation. 

Lasers and masers can produce conditions that are totally incompatible with the 
imptions outlined above. The radiation in these devices is coherent, sharply 
rochromatic, and extremely intense, and it is essential for their operation that 
ited atomic states of the active substance are appreciably populated. If, under 
;e circumstances, only two levels r and s of an atomic system, which interacts 
i the radiation, are important, Eqs. (19.24) may be employed with the pertur- 
on 

V(t) = — (e‘“ f A 0 + e~ i<at AS) • p (19.84) 

me 


e, A 0 is a complex-valued constant vector, and the electric dipole approximation 
been made. If selection rules cause the diagonal elements of V in the states r 
.? to vanish and if, furthermore, the off-diagonal matrix element (r| A 0 • pjs) + 0, 
i. (19.24) assume the simple form 


lh dt (cj {ae- il ^- w)t+S] 


ae 


/[(o> r5 — co)t+ 6] 



(19.85) 


;re 


c r = (r\T(t, /o)|m)> and c s = {s\t(t, t 0 )l^o)> 

iote transition amplitudes. The positive constant a is proportional to the strength 
he electromagnetic field and to the magnitude of the dipole matrix element. Since 
plan to apply the solutions of (19.85) near resonance, a> ~ a > rs , in setting up this 
lation we have neglected the rapidly oscillating matrix elements that are propor- 
lal to e ±,( “- + " )f . 


Exercise 19.8. Equation (19.85) is formally the same as the equation of motion 
a spin one-half particle with magnetic moment in a rotating magnetic field. Apply 
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the technique of solving the dynamics of the particle with spin to the problem of a 
two-level atom interacting with a classical monochromatic electromagnetic wave in 
the limit of long wavelength. Derive the analogue of Rabi’s formula, discuss the 
resonance behavior of the transition probability, and examine the dependence of the 
resonance width on the physical parameters. 

Finally, it should be remembered that in this chapter the field is regarded as a 
prescribed external agent, perturbing the quantum system such as an atom but itself 
incapable of being reacted upon. Evidently, this approximation is quite restrictive 
and ignores all effects in the interaction between light and matter which go beyond 
simple absorption and stimulated emission, notably the scattering of light. Even the 
process of spontaneous emission, i.e., emission from an excited atom in the absence 
of all applied electromagnetic fields, cannot be described properly without including 
the field as a dynamical system that can be found in various quantum states of 
excitation, quite similar to the material system with which it interacts. The elements 
of the quantum theory of radiation will be taken up in Chapter 23. 

6. The Photoelectric Effect. In the photoelectric effect, the absorption of light 
by an atom (or nucleus) leads to the emission of an electron (or nucleon) into a 
continuum state. In contradistinction to the sharp-line behavior of cr(w) for jumps 
from discrete to discrete levels, the cross section for the photoelectric effect is a 
smooth function of the frequency. Hence, by comparing (19.59) with (19.60) and 
adapting the notation to the transitions into the continuum of final states, we obtain 

da(o) ) = — ; — | (A: | exp ( i — n • r )p ■ e|s)| 2 ~~ (19.86) 

m~co \ c / Aoj 

Here, to is the frequency of the incident radiation and of the quantum jump. As 
before, s is the discrete initial state, but k, the final state, now lies in the continuum. 
The quantity da(a>) denotes the differential cross section corresponding to emission 
of the electron into a sharply defined solid angle dCl with respect to the linearly 
polarized beam. Finally, An is the number of electron eigenstates in the frequency 
interval A<y, with an average energy E k = ho) + E s and corresponding to the given 
solid angle. 

To exemplify the evaluation of (19.86), the extreme assumption will be made 
that fuo is much larger than the ionizing potential of the atom — so large in fact that 
the final electron states can be satisfactorily approximated by plane waves instead 
of the correct positive-energy atomic electron wave functions (e.g., screened Cou- 
lomb wave functions). If we use plane waves that are normalized in a very large 
cube of length L containing the atom, we have 

1 h Z k 1 

<r|£> = fJE ex P( /k • r )> E k = 

and according to (4.57) the density of electron energy eigenstates, 

An _ dSl CT 3/2 V£fc L 3 
A to 4 tt V27r 2 ^ 2 

If the electron is initially in the K shell, ejection from which is comparatively prob- 
able, the initial wave function is given by (12.98) as 



<r|j> = 
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>stituting all these quantities into (19.86), we obtain for the differential photo- 
ission cross section 


r M 

Kl 


e z k Z 3 
2mch 2 7r 2 (o a 3 



— ik ■ r) exp 



e • j V exp | 



ing to the Hermitian property of the momentum operator, and the transverse 
iracter of the light waves (e • n = 0), we can evaluate the integral: 



ere q is the momentum transfer in units of h, i.e.. 



q 



nee, finally. 


da(a)) 

dVL 


32 e 2 k(e • k) 2 Z 5 
mew a 5 



(19.87) 


dch is in qualitative agreement with experimental observations. 

If h(o is comparable with the ionization potential, a better approximation to the 
al state electron wave function must be used. Ideally, exact continuum eigen- 
lctions of the atomic Hamiltonian could be used which represent the emission of 
electron into a state approaching asymptotically the plane *wavee‘ k r . Such a state 
^k ^r) defined by (13.39). The state i/4 _) (r) with asymptotically incoming waves 
her than t/4 +) (r) with outgoing waves must be chosen, because a wave packet of 
proximate momentum k moving away from the origin is constructed as a super- 
sition of neighboring functions \p^{r), just as an incident wave packet moving 
vard the origin was shown in Chapter 13 to be a simple superposition of t/4 +) (r) 
actions. Since after inclusion of the bound states both t/4 +) (r) and »/4 -) (r) are 
parately complete sets, a nearly plane wave packet moving away from the origin 
uld in principle be expanded in terms of </4 +) (r) as well as j/4 -) (r). In the former 
se, momenta k of all directions would be needed in the superposition in order to 
mic the desired wave packet by judicious destructive interference. In this ap- 
oach, the cross section calculation would be far more complicated than if the 
arply defined set of t/4 ~ ) (r) eigenfunctions are used. 4 

Exercise 19.9. Derive the hydrogenic ground state wave function in momen- 
m space, and show that it can be used in the calculation of the cross section for 
s photoelectric effect in a one-electron atom with nuclear charge Ze. 


4 See Gottfried (1966), Section 58. 
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7. The Golden Rule for Constant Transition Rates. In many cases of practical 
interest, the perturbation V does not itself depend on the time, but the time devel- 
opment of the system with a Hamiltonian H = H 0 + V is nevertheless conveniently 
described in terms of transitions between eigenstates of the unperturbed Hamiltonian 
H 0 . For example, a fixed potential scatters a beam of particles from an initial mo- 
mentum state into various possible final momentum states, thus causing transitions 
between the eigenstates of the free particle Hamiltonian. Scattering will be discussed 
from this point of* view in Chapter 20. 

The theory is again based on the approximate first-order equation (19.21), but 
we now choose t 0 — 0: 

(k\T(t, 0)| 5 ) = -j \ (k\V(t')\s)e i ^ t ' dt' (19.88) 

Tt J 0 

i 

If V does not depend on the time, the integration is trivial. We get for k ^ s, 

(k\ Vis) 

(k\T(t, 0)\s) = f'j -E (1 - e^f (19.89) 

P's 

At t = 0, the system is known to have been in the initial discrete state s. The 
transition probability that at time t it will be in the unperturbed eigenstate k # 5 is 
given by 


n-,«) = |<fe|ra, 0)U>| 2 = 2|<fc|y|s)| 2 - Vr - - 8 ^ 

\T k E s ) 


(19.90) 


This is an oscillating function of t with a period , , , except at or near (o ks = 0. 

*>*» | 

Its derivation from (19.21) presupposes that the time development operator remains 
close to its initial value, unity, in the interval (0, t ). During such times, the proba- 
bility P s ^ s (t) = |{'S'|T(t, 0)|s}| 2 ~ 1, and the transition probabilities to states k f s 
must stay small. We know from the Rayleigh-Schrodinger perturbation theory that 
the eigenstate | s) of H 0 is very nearly an eigenstate also of the complete Hamiltonian 
H if the perturbation is weak enough so that 


(k\V\s) 
E k ~ E s 


« 1 


(19.91) 


This same inequality ensures that the transition probabilities to states with an 
unperturbed energy appreciably different from the initial value can never become 
large. On the other hand, transitions to states with energies that are nearly equal to 
E s can be very important. If such states exist, there may be a good chance of finding 
the system in them after some time has elapsed. Since transitions to states with an 
E k radically different from E s are rare, we may say that if the perturbation is weak, 
the unperturbed energy is nearly conserved. Of course, the total energy is strictly 
conserved, and (H) is a rigorous constant of the motion. 

From (19.88) we see that as long as 


0) ks t\ « 1 


(19.92) 
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insitions to states with E k ~ E s have amplitudes proportional to t, and (19.90) 
comes approximately 

P k Ut) ~ r^\(k\V\s)\ 2 t 2 (19.93) 


le transition probability increases quadratically with time. This circumstance has 
iportant consequences if many states k are available having an energy E k ~ E s , as 
ill happen if H 0 has a near-continuum of eigenstates in the vicinity of E s . 

As a concrete example on which we can fix our ideas, let us consider the helium 
Dm again, regarding the electrostatic repulsion between the electrons as a pertur- 
ition (and neglecting radiative transitions altogether): 


eI + M _ 

2m 2m r l r 2 



(19.94) 


re excited state in which both electrons occupy the hydrogenic 2s level has an 

tperturbed energy of 6 X 13.6 = 81.6 eV above the ground state. Since this energy 

greater than the energy required (54.4 eV) to lift one electron into the continuum 

ith zero kinetic energy, the (2s) 2 level coincides in energy with a continuum state 

■rmed by placing one electron in the bound Is state and the other electron in the 

mtinuum. But the wave functions of these two types of states are very different. 

he first is a discontinuous function of E, and the second is a continuous one. Figure 

1.2 presents an energy level diagram for the unperturbed two-electron system. If 

t = 0 the atom has been excited to the (2s) 2 state, the electrostatic Coulomb 

:pulsion V between the electrons will cause radiationless (autoionizing or Auger) 

ansitions to the tontinuum. Radiative transitions to lower levels with the emission 

f light also take place but are much less probable. The effects of radiationless 

ansitions from autoionizing states, like the (2s) 2 state in the helium atom, are ob- 

jrvable in electron-atom collisions. Auger transitions are observed in heavy atoms 

lat have been excited by the removal of an electron from an inner shell. From such 

ates the emission into the continuum of so-called Auger electrons competes with 

* 

ie emission of X rays. The internal conversion of nuclear gamma rays is another 
sample of radiationless transitions. 

Since the formulas in this section were developed for quadratically integrable 
Lgenstates of H 0 , it is convenient to imagine the entire system to be enclosed in a 
ery large box, or better still, to enforce discreteness of the unperturbed eigenstates 
y imposing on the unbound states periodic boundary conditions in such a large box. 
'n our example, these states are approximately plane waves, although at the lower 
nergies they must be taken to be appropriately screened Coulomb wave functions.) 
l order to be able to replace sums by integrals, the number of these quasi-continuum 
tates per unit energy interval is introduced. This density of final unperturbed states 
i denoted by p f (E). 

According to (19.90), the total transition probability to all final states under 
onsideration, labeled collectively by /, is given by 


2 U«) = 2 f |<kjVk)| 2 1 - C0S ~ y Pf (E k ) dE k (19.95) 

kef J (At As-) 

n usual practice, both |(k|V|s')] 2 and p f (E k ) are sensibly constant over an energy 
ange A E in the neighborhood of E s . In this same interval, as a function of E k the 
actor (1 — cos (o ks t)!{E k — E s ) 2 varies rapidly for all t satisfying 


tAE » h 


(19.96) 
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-108 ev 1 1 

Figure 19.2. Partial energy-level diagram for the unperturbed states of two electrons in 
the Coulomb field of a helium nucleus. The principal quantum numbers for the two 
electrons are n Y and n 2 . The continuous portions of the spectrum are shaded. The doubly 
excited discrete state n, = n 2 - 2 is embedded in the continuum of states, as are all higher 
excited levels. After a radiationless autoionizing transition to the continuum, electron 1 is 
emitted with positive energy and an effective continuous n l value as defined in Eq. 
(13.113), while electron 2 drops to the n 2 = 1 level. 


and has a pronounced peak at E k = E s . This behavior, shown in Figure 19.3, confirms 
that transitions which tend to conserve the unperturbed energy are dominant. The 
interval A E is generally comparable in magnitude to E s itself. If this is so, h/AE is 
a very short time, and there is a considerable range of t such that (19.96) is fulfilled 
and yet the initial state s is not appreciably depleted. During such times, our per- 
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1 - COS 
<W 2 



igure 19.3. Plot of (1 - cos o)t)fco 2 versus w for different values of the time, 

[t — 2 and 4). The width of the peak is inversely proportional to r, and its height 
creases as t 2 , so that the area under the curve is proportional to t. 


Lrbation method is expected to be valid, and (19.95) can be simplified to good 
pproximation as 

2 p ^s(t) = \ \(k\V\s)\ 2 p f (E s ) f " 1 d a, to (19.97) 

is/ n, J « 

/here condition (19.96) has been invoked in extending the limits of the integral to 
: oo . Under the conditions stated, we can evaluate the integral for t > 0 as 


1 - cos (x> ks t 


<4s 


daiks - 



sin x , 
ax = 


IT t 


— oo 


X 


(19.98) 
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Hence, when summed over the set of similar final continuum states, the total tran- 
sition probability increases linearly with time, and we can speak of a constant tran- 
sition probability per unit time or transition rate w, 


™ = P k Ut) = ^ \(k\V\s)\ 2 p f (E s ) 

dt kef n 


(19.99) 


This remarkable result comes about because we have summed over transitions that 
conserve the unperturbed energy strictly (E k — E s ) and transitions that violate this 
energy conservation (E k A E s ). According to (19.93), the former increase quadrat- 
ically with time, but the probability of the latter oscillates periodically. The sum is 
a compromise between these two tendencies: the transition rate is constant. Fermi 
has called formula (19.99) the Golden Rule of time-dependent perturbation theory 
because it plays a fundamental part in many applications. 

In view of its universal relevance, it is useful to give an alternate heuristic 
derivation of the Golden Rule from (19.88). Without further discussion and detailed 
justification of each step, we write a chain of equalities: 


w 


dP k Ut) 

dt 


imt,o)h)p 

at kef 


= T 2 [ (k\V(t')\s)e iw ^ dt' f (k\V(t")\s)* e - iw ^ dt" 

dt kef ft Jo Jo 

da>ks l L dt ' dt " ^lKOk><fc|V(Ok>* e^'-^pfiEk) 

= JtT Pf(Es) j;j> K*l^')l 5 >i 2 ^' - O dt " = y |<£|V(0k>| 2 Pf(E s ) 

(19.100) 


If V is constant in time, the Golden Rule results. 

We illustrate the Golden Rule by applying it to the problem of radiationless 
atomic transitions, caused by the interaction (19.94), from an initial (2s) 2 two- 
electron state (see Figure 19.2). The Golden Rule requires that we evaluate the 
matrix element 


< f\V\i ) = fj 'P*s(r 2 )'l'*s(X i) | r 1 r | <Mri)iA 2s (r 2 ) d 3 r x d 3 r 2 (19.101) 

Here we have labeled the initial and final states by i and /, respectively. In the final 
state, one electron is in the lowest energy state (Is) and the other is in the continuum, 
with wave number k. When the standard multipole expansion 


2 3TT P,( f ■ n 

€=o r> 


= 4 77 X 


1 


r-t + 1 OP 4- 1 ^ 

e=0 >> ZX, -r 1 _ 


S r?*(t i)y?(t 2 ) 


(19.102) 


is substituted in (19.101), it is evident that the matrix element vanishes, unless the 
continuum electron also is in an s state (€ = 0), as anticipated in the notation for 
<Afcr( r i)- If the continuum state is approximated by a plane wave corresponding to 
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rgy E k = h 2 k 2 /2m = 2 E 2s — E ls , the autoionization or Auger transition rate 
omes, with this selection rule, for an atom orion with nuclear charge ze. 


2rre 4 

~~h~ 



e -Zr 2 ,a Sin kr l 1 

kr x r l2> 


X 




g— Zr^/la 



r\ dr i r\ dr 2 


P f (E k ) 


e N denotes the product of the four normalization constants for the wave func- 
is in the matrix element. The product of N 2 and the free particle energy density 
anal states is 


N 2 P f (E k ) = 


mk 

2 n Vh 2 



terms of the Bohr frequency, me 4 /h 3 (=4 X 10 16 sec ^ the calculation of the 
oionization/ Auger rate gives the simple result 

4 

me 

w=C- r (19.103) 

n 

ere C is a numerical constant or order of 10~ 3 . 


Exercise 19.10. Deduce the selection rule for autoionizing transitions from a 
np initial to a Is kt final state. Do the same for transitions from (2 p) 2 to Is ki. 
:ntify the terms in the multipole expansion of the electron-electron Coulomb in- 
action that are responsible for the transitions. 


Exercise 19.11. Show that the approximate autoionization/Auger rate for the 
irogenic (2s) 2 state is independent of Z. Give a physical explanation. Is this con- 
ision changed, if Coulomb wave functions (Section 13.8) are used to describe the 
itinuum state, instead of plane waves? 


Even if the perturbation is weak, there may be good reasons for proceeding 
yond the first perturbation term in evaluating the transition amplitude. If V is 
astant in time and we retain the second term in (19.20), we obtain for t 0 = 0 and 
£ s, 


(k\ T(t, 0)| s) = P Vls i (1 - e^') 


Ev - E , 


+ 2 (*|v|«> 


(m\V\s) ( l ~ e ito ^ 1 - e iw ^ 


(19.104) 


E x - E,„ \ E k - E x 


E k - E„ 


le application of this formula requires care. Nonreversing transitions at constant 
:es to states k which lie in a continuum with E k ~ E s may still be possible in 
;ond order if the matrix elements {m|V|s) = 0 for all states that conserve the 
perturbed energy, E m E s . Under these conditions, the second term inside the 
acket in (19.104) is negligible because it oscillates in time with small amplitude, 
te Golden Rule still holds provided that the first-order transition matrix element 
the perturbation V is replaced by the expression 


(k\V\s) -* 


(fe| V\m){m\ U|s) 


E s E m 


(19.105) 
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This effective transition matrix element is sometimes interpreted by describing the 
transition as a two-step process, in which a virtual transition from the initial state s 
to an intermediate state m is followed by a second virtual transition from the state 
m to the final state k. The (unperturbed) energy of the intermediate state can be 
entirely different from the energy of the initial and final states, but in the overall 
transition from the initial to the final states energy is conserved. 

In the general case, the second term in the bracket in (19.104) cannot be ne- 
glected. The energy denominators may vanish if there are energy-conserving inter- 
mediate states, and, as for the degenerate stationary state perturbation theory dis- 
cussed in the last chapter, a more accurate treatment is required. 5 

A constant transition rate can also occur if the perturbation, instead of being 
constant in time, varies harmonically with frequency ay. 

V(t) = V 0 e~ iwt + Vl e io>t (19.106) 

The interaction (19.84) of a charged particle with an electromagnetic wave is an 
example. Such a harmonic perturbation induces enduring transitions between an 
initial unperturbed state s and a final state k, if frequency matching a ) ks «= at or 
(o sk « a) is achieved. 

If E k > E s , we have the case <o ks ~ at. Substitution of (19.106) into (19.88) 
shows that (19.90) generalizes to 


At) = |<£|?a,0)|.s>| 2 = £5 |<*|V|j)p 


1 — cos(<w fo — Oj)t 
(0) ks - Oif 


(19.107) 


In arriving at this result, rapidly oscillating terms containing e‘ (<0ks+0>)t are neglected 
because they are not effective in causing lasting transitions. 

The same arguments that were employed in deriving the Golden Rule (19.99) 
can be invoked here to calculate the transition rate by evaluating the integral 



1 — cos(<w fa — 0 ))t 
(at ks - Oif 


d{(O ks — <w) = 7 T t 


(19.108) 


This procedure then leads to a generalized Golden Rule, 


277 

W = — \(k\V 0 \s)\ 2 p f (E s + hco) 


(19.109) 


if the perturbation is monochromatic, and the final states form a quasi-continuum 
with a final density of states p f . The transition described here corresponds to ab- 
sorption of energy induced by the harmonic perturbation (19.106). 

As a specific example, we consider the absorption of electromagnetic radiation 
by an atom. The incident field is represented by a plane wave vector potential: 


V(t) = — A • p = — [A 0 e i(kr - Mt) + A$e -i(kr ~“ r) ]e • p (19.110) 
me me J 


which is a limiting case of (19.44) for monochromatic radiation, and thus an infi- 
nitely extended plane wave packet. For such a wave, the time-average incident en- 


s For an example, see the discussion of resonance fluorescence in Sakurai (1967), Section 2-6. 
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gy per unit area and unit time, i.e., the intensity , is found from the Poynting vector 
9.54) to be 

w 2 

I 0 hco = - — |A 0 | 2 (19.111) 

2itc 

here I 0 is the incident flux of quanta (photons) with energy fuo. Since in the tran- 
tion an energy ho) = — E is absorbed, the rate of energy absorption is 

97 re~(t) 

ha) w = ~ ■■ 2 -. 2 - |A 0 | 2 |(k|e'" n ' r/c e • p|5)| 2 pf(£, + hoi) (19.112) 
m c 

the final energy level is narrow, we can write 

p f (E s + h(o) = 8(E k - E s + hw) = h~ l 8(co ks - o) (19.113) 

le absorption cross section cr(co ) was defined in Section 19.4, where the incident 
diation was realistically represented by a finite wave packet. Here we use the 
ealization of infinite plane waves. The cross section is then simply calculated from 
e energy absorption rate, 

I 0 (r((o) — w (19.114) 

ombining Eqs. (19.11 1)— ( 19.1 14), we obtain 

4jr 2 e 2 

<r((o) = rr \(k\e iwn ' r/c p • e|,s>| 2 8(a> ks - to) (19.115) 

(om~cn 

hich is the same result as (19.83). 


Exercise 19.12. Derive the Golden Rule for stimulated emission induced by 
harmonic perturbation (19.106). This case occurs when the final unperturbed en- 
-gy E k lies below the initial energy E s . 


Exponential Decay and Zeno’s Paradox. Suppose that a system, which is 
jrturbed by a constant V as described in the last section 2 is known to be in the 
aperturbed energy eigenstate state s at time t. The probability that the system will 
take a nonreversing transition in the ensuing time interval between t and t + dt is 
pal to w dt, if the conditions under which the Golden Rule (19.99) was derived 
btain and k symbolizes the totality of available final states. Stochastic processes 
ith constant w are familiar in probability theory. The probability P s (t + dt) of 
nding the system in state s at time t + dt can easily be derived if we argue as 
fllows: The system will be in state s at time t + dt only if (a) it was in s at time t, 
ad (b) it has not decayed from this state during the interval dt. Since the probability 
>r not decaying is (1 ~ w dt), we have 


P s (t + dt) = P,(0(1 — w dt) 


(19.116) 


ith the initial condition P(0) — 1. Solving (19.116), we infer the probability of 
nding the system at any time t still undecayed in the initial state: 



(19.117) 


his is the famous exponential decay law. The property (19.116) is implied by the 
lore general relation 

P s(t\ + t 2 ) — P,(fl)P.vfe) 


diich is characteristic of the exponential function. 


(19.118) 
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Caution is required in making the preceding argument because it assumes that 
P s (t) changes only by virtue of transitions out of state s into other states but disre- 
gards the possibility that state s may be replenished by transitions from other states. 
In particular, it assumes that there are no reverse transitions from the final states 
back into the initial state. Even if these assumptions are valid, the probability 
P s (t + dt) can be equated to the product of P s (t) and (1 — w dt) only if the actual 
determination of whether or not at time t the system is in state s does not influence 
its future development. In general, this condition for the validity of (19.116) is 
emphatically not satisfied in quantum mechanics. Usually, such a measurement in- 
terferes with the course of events and alters the chances of finding the system in s 
at time t + dt from what it would have been had we refrained from attempting the. 
measurement. Obviously, an explanation is required. 

If the measurement is performed within a very short time interval, in violation 
of condition (19.96) and before the system has begun to populate the final states 
with a probability increasing linearly in time, the effects on the initial state can be 
dramatic, as we will see later. First, however, we imagine that the time interval dt, 
while very short compared with the depletion time of the initial state, is much longer 
than ft/AE, as demanded by (19.96). 

Starting with 

PM - |(s|?(0, 0)|s)| 2 - 1 

quantum mechanics requires that the probability at all times is to be calculated from 
the amplitudes, which are the primary concepts, hence 

Pft) = | (s | f(t, 0)[s>| 2 and Pft + dt) = |<s|?(r + dt, 0)|s>| 2 

There is in general no reason why these expressions should be related as 

\(s\T(t + dt, 0)U)| 2 = \(s\T(t, 0)|s>| 2 (l - w dt) (19.119) 

as was assumed in writing (19.116). However, it can be shown that the very con- 
ditions which ensure a constant transition rate also establish the validity of Eqs. 
(19.116) and (19.119). 

We can gain a qualitative understanding of the irreversible decay of a discrete 
initial state s embedded in a continuum of final states k with similar unperturbed 
energies, if we recall that the time evolution of the transition amplitudes is governed 
by the coupled linear integral equations (19.15), or their equivalent differential form, 
adapted to the choice t 0 = 0: 

ih 4 <*| Tit, 0) | s> = S e'M* I V\ n)(n \ f(t, 0) 1 5 ) (19.120) 

dt n 

According to these equations, a state n feeds a state k if (&|F|n) # 0. Thus, tran- 
sitions from the initial state s to the various available final states k occur, and at the 
same time these final states contribute to the amplitude of the initial state. As the 
amplitudes of the final states k increase from their initial value, zero, they must grow 
at the expense of the initial state, since probability is conserved and the time de- 
velopment operator T(t, 0) is unitary. We might expect that as the amplitudes of the 
states k increase, they would begin to feed back into state s. Indeed, this is what 
happens, but because of the different frequencies co ks with which this feedback oc- 
curs, the contributions from the many transition amplitudes (k\T(t, 0) |s) to the equa- 
tion of motion for (j.|r(f, 0)|.s) are all of different phases. Hence, if there are many 
states k, forming essentially a continuum, these contributions tend to cancel. It is 
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is destructive interference which causes the gradual and irreversible exponential 
;pletion of the initial state without corresponding regeneration. 

To make these notions more precise and derive the exponential decay law, we 
ust solve the equations of motion under the same assumptions as before (i.e., 
mstant perturbation V, transitions from a discrete initial state s to a quasi-contin- 
lm of final states), but we must remove the uncomfortable restriction to times that 
e short compared with the lifetime of the initial state. In effect, this means that it 
no longer legitimate to replace the transition amplitude (s|f(r, 0)|s) on the 
ght-hand side of (19.120) by its initial value, (s|?(f, 0)|s) ~ 1. However— and this 
the fundamental assumption here — we continue to neglect all other contributions 
the change in (k\T(t, 0) | s) and for t > 0 improve on the first-order approximation 
9.88) by using the equations 

ih 4 (k\T(t, 0)|j> = e ia> ^(k\V\s)(s]T(t, 0)|s) (k + s) (19.121) 
at 

The justification for this assumption is essentially physical and a posteriori. To 
certain extent, it is based on our previous experience with the short-term approx- 
nation. If the perturbation is weak, ( k\f(t , 0)|s) will remain small for those tran- 
tions for which (o ks is appreciably different from zero. Only those transition am- 
litudes ( k\f(t , 0)|s) are likely to be important which conserve unperturbed energy, 
xch that co ks ~ 0. On the other hand, the matrix elements (&| Vj«) that connect two 
ossible final unperturbed states k, n + s for which E k E n , are usually small and 
ill be neglected. (Such transitions are, however, basic in scattering processes; see 
hapter 20.) 

The integral'form of Eq. (19.121) is 

(k\T(t, 0)| f) - -j (k|V|s> f e^’islUt', 0)|s) dt' (k # s) (19.122) 
ft Jo 

he equation of motion for (s|f(r, 0)|s) is, rigorously, 

& 

ih 4 <s|f(t, 0 )|j> = E e^slVimlTit, 0)|s> + <s|y|s><s|f(t, 0)| s) (19.123) 

at k *s 

he term k = s has been omitted from the sum and appears separately on the 
ight-hand side. (In decay problems, we frequently have (jjy|.y) = 0, but in any 
ase this term produces only a shift in the unperturbed energy levels s, as is already 
nown from the Rayleigh-Schrodinger perturbation theory.) 

If (19.122) is substituted into (19.123), we obtain the differential-integral equa- 
ion for the probability amplitude that the system will at time t ^ 0 still dwell in 
tie initial state s : 

j (s\f(t, 0)|s> = -- I^Ms)! 2 f (s\T(t r , 
dt n klks Jo 

- ^<5|y|s)<j|7T(f,0)|s> 

?he solution of this equation demands care. We are interested in times t which imply 
apid oscillations of the factor e ,c ° ks(, '~ t) in the integrand as a function of the final 
tate energy, E k . The slowly varying amplitude {j|T(/, 0)|j) can therefore be re- 


0 )\s)e i< °^ t '~ t) dt' 

(19.124) 
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moved from the t' integrand, and the remaining integral can be evaluated by using 
the formula (A. 19) from the Appendix: 

f e dr = — - — = 776(a)) - zP - ( cot » 1) (19.125) 

Jo zo) + e ft) 

Here P denotes the Cauchy principal value. The resulting differential equation is 
solved for t > 0 by 


(s[ f(t, 0)[s) = exp 


~Y,\(k\v\s)r «(»„>- 1 

fl k=£s fl 


(19.126) 


where we have denoted the perturbative energy shift of the level s, up to second 
order in V, by 


A E s = <j|V|j) + E 

k±s 


\Ws )\ 2 

E s ~ E k 


(19.127) 


Equation (19.126) is the anticipated result, since under the assumptions made in 
deriving the Golden Rule, 

w = 2 l<*l V|*>| 2 5(a>*) - ^ \(k\ V\s)\ 2 p f (E s ) (19.128) 

n k*s n 

Hence, we see that 


(s|f(/, 0 )|j) = exp 




(19.129) 


describing the exponential decay of the unstable state. 

To obtain nonreversing transitions and a progressive depletion of the initial 
state, it is essential that the discrete initial state be coupled to a large number of 
final states with similar energies. However, the fact remains that the exponential 
decay law, for which we have so much empirical support in radioactive decay pro- 
cesses, is not a rigorous consequence of quantum mechanics but the result of some- 
what delicate approximations. 

If (19.129) is substituted back into (19.122), the integration can be carried out 
and we obtain for t > 0, 


(k\f(t,0)\s) = (k\V\s) 


1 — exp 



exp 


- - + A E s - E k )t 


w 

E k ~ (E s + A E s ) + ih - 


(19.130) 


Hence, the probability that the system has decayed into state k is 


P k Ut) = |<*|V|i>| 3 


1_2exp r^v H 


( E S + A E s - E k 
, h 


t 1 + expj 


r 


(E k — E s — A E s f + 


(19.131) 
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ere we have set T = hw. After a time that is very long compared with the lifetime 
\ we obtain the distribution 




kfcivw 

r 2 

(E k - E s - A E s f + — 


(19.132) 


libiting the typical bell-shaped resonance behavior with a peak at E k = E s + A E s 
i a width equal to T. The transition probability to a selected final state k oscillates 
a function of time before it reaches its asymptotic value (19.132). In spite of these 
filiations, the total probability of finding the system in any final state increases 
motonically with time. 

Exercise 19.13. Prove from (19.131) that the sum (integral) ^ \(k\f(t, 0)|s)| 2 

k^s 

er the final states is equal to 1 — exp( — Tt/fT) as required by the conservation of 
ibability. 


In a somewhat imprecise manner, the results of this section can be interpreted 
implying that the interaction causes the state s to change from a strictly stationary 
ite of H 0 with energy E s into a decaying state with (normalized) probability 
E) dE for having an energy between E and E + dE: 



(19.133) 


A remark about the shape of the absorption cross section calculated in Section 
.5 may now be made. If the mechanism responsible for the depletion of an energy 
yel E 2 , after it is excited by absorption of radiation from a stable discrete energy 
vel E lt broadens the excited state in accord with (19.133), the absorption proba- 
lity for the frequency co — ( E — E x )/h must be weighted by the probability 
9.133). The absorption cross section, instead of (19.83), is therefore more accu- 
tely: 


o-(ca) = 


4-7T 2 a 

m 2 (o ks 


[(£[expf i — n • r Ip • e|s )| 2 ~ 


1 


m 


277 (o> — w 2 i) + hi Ah 


(19.134) 


owing the characteristic resonance (or Lorentz) profile of the absorption “line.” 


Exercise 19.14. Show that in the limit T — » 0 the distribution (19.133) be- 
>mes a delta function and the cross section (19.134) approaches the form (19.83). 


We saw that exponential decay of a discrete initial state embedded in a quasi- 
mtinuum of final states presupposes that the system is allowed to evolve without 
i intervening observation of its state at least during a brief time interval hlhE, 
hich is of the order of a typical period associated with the system. On the other 
tnd, if this assumption is violated and the state of the system is observed within a 
uch shorter time interval, we can no longer expect that its time development will 
; unaffected. Here we consider the extreme possibility that a system which is ini- 
ally represented by a state vector ^(0) and which under the action of a time- 
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independent Hamiltonian H evolves into M)(0, in the Schrodinger picture, is sub- 
jected to frequently repeated observations to ascertain whether or not it is still in its 
initial state. In other words, we contemplate a measurement of the projection oper- 
ator, or density operator, p 0 = |^ , '(0)>(^ r (0)|. This operator has eigenvalues 1 and 
0, corresponding to “yes” and “no” as the possible answers to the experimental 
interrogation that is designed to determine if the system has survived in the initial 
state. The observations are assumed to be ideal measurements leaving the system in 
state ^(0) if the answer is “yes,” corresponding to eigenvalue 1. The measurement 
thus resets the clock, and 'P(O) serves again as the initial state of the subsequent 
time evolution. If N such repeated measurements are made in the time interval t, the 
probability that at every one of the interrogations the system is found to have sur- 
vived in its initial state is 


p N (t) = i<w)iwv)>r 

- | (TO) I P(t/N, 0) I TO)XTO) I T(t/N, 0) I TO)> |- N 


(19.135) 


Since we are interested in the limit IV — > °°, it is appropriate to expand the time 
development operator in powers of t/N 



= 1 - 


i t 1 

hN H ~ 



H 2 + ... 


(19.136) 


Substituting this approximation into (19.135), we obtain 


P N (t) = 1 1 + <TO)|H|TO)>VA0 2 - <TO)|H 2 mO)>(t/A0T no 1TO 

= [1 - (A H) 2 0 (t/Nf] N (19.13 /) 


In the limit N — » °° the system is ceaselessly observed to check if it is still repre- 
sented by its initial state. The probability for finding it so is 


lim P N (t ) — lim 

N— > CO N^> CO 


1 - 


e° = 1 


(19.138) 


and we conclude that under these circumstances the system is never observed to 
change at all. This peculiar result is known as Zeno’s paradox in quantum mechanics 
(also as Turing’s paradox). Experiments have shown that it is indeed possible to 
delay the decay of an unstable system by subjecting an excited atomic state to in- 
termittent observations with a high recurrence rate. This result was derived for gen- 
eral dynamical systems, without the use of perturbation theory. 


Exercise 19.15. A spin one-half particle with magnetic moment is exposed to 
a static magnetic field. If the state were continuously observed in an ideal measure- 
ment, show that the polarization vector would not precess. 


Problems 

1. Calculate the cross section for the emission of a photoelectron ejected when linearly 
polarized monochromatic light of frequency is incident on a complex atom. Sim- 
ulate the initial state of the atomic electron by the ground state wave function of an 
isotropic three-dimensional harmonic oscillator and the final state by a plane wave. 
Obtain the angular distribution as a function of the angle of emission and sketch it 
on a polar graph for suitable assumed values of the parameters. 
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Calculate the total cross section for photoemission from the K shell as a function of 
the frequency of the incident light and the frequency of the ^T-shell absorption edge, 
assuming that fuo is much larger than the ionization potential but that nevertheless 
the photon momentum is much less than the momentum of the ejected electron. Use 
a hydrogenic wave function for the K shell and plane waves for the continuum states. 
By considering the double commutator 

[[ H , e ikr ], e~ ikr ] 

obtain as a generalization of the Thomas-Reiche-Kuhn sum rule the formula 

X <£, - E.)|<»| **■'!»> |’ = ^ 

n 2 m 

Specify the conditions on the Hamiltonian H required for the validity of this sum 
rule. 

A charged particle moving in a linear harmonic oscillator potential is exposed to 
electromagnetic radiation. Initially, the particle is in the oscillator ground state. Dis- 
cuss the conditions under which the electric dipole-no retardation approximation is 
good. In this approximation, show that the first-order perturbation value of the inte- 
grated absorption cross section is equal to the sum of dipole absorption cross sections, 
calculated exactly. 

For the system described in Problem 4, derive the selection rules for transitions in 
the electric quadrupole approximation, which correspond to retaining the second term 
in the expansion (19.64). Calculate the absorption rate for quadrupole transitions and 
compare with the rate for dipole transitions. 



CHAPTER 20 


The Formal Theory of Scattering 


It is natural to look at a scattering process as a transition from one 
unperturbed state to another. In the formal theory of scattering, infinite 
plane waves serve as idealizations of very broad and long wave packets, 
replacing the formulation of Chapter 13 in terms of finite wave packets. 
Although the formal theory is patterned after the description of simple 
elastic deflection of particles by a fixed potential, it is capable of 
enormous generalization and applicable to collisions between complex 
systems, inelastic collisions, nuclear reactions, processes involving 
photons (such as the Compton effect or pair production), collisions 
between pions and nucleons, and so forth. Almost every laboratory 
experiment of atomic, nuclear, and particle physics can be described as a 
generalized scattering process with an initial incident state, an interaction 
between the components of the system, and a final scattered state. 1 Formal 
scattering theory enables us to predict the general form of observable 
quantities, such as transition probabilities or cross sections, quickly and 
directly from the symmetry properties of a system, and it is readily 
adapted to a relativistic formulation. The scattering matrix, already 
introduced in Chapters 6 and 16, contains all relevant dynamical 
information. This chapter centers on a generalization of this concept, the 
scattering operator. 


1. The Equations of Motion, the Transition Matrix, the S Matrix, and the 
Cross Section. A collision can be thought of as a transition between two unper- 
turbed states. If the scattering region is of finite extent, the initial and final states 
are simply plane wave eigenstates of definite momentum of the unperturbed Ham- 
iltonian, H 0 = p 2 /2 m, and the scattering potential causes transitions from an initial 
state with propagation vector k to the final states characterized by propagation vec- 
tors k'. 

At first sight, it may seem strange that an incident wave can be represented by 
an infinite plane wave that is equally as intense in front of the scatterer as behind 
it. In Chapter 13, the unphysical appearance of incident waves behind the scatterer 
was avoided by superposing waves of different k and canceling the unwanted portion 
of the wave by destructive interference. The resulting theory, involving wave packets 
and Fourier integrals at every step, was correct but clumsy. In the present chapter, 
precisely the same results will be achieved in a more elegant fashion by the use of 
suitable mathematical limiting procedures. To avoid mistakes, however, it is advis- 
able always to keep in mind the physical picture of the scattering of particles, which 
first impinge upon and subsequently move away from the scatterer. 

In scattering problems, we are interested in calculating transition amplitudes 


‘A comprehensive treatise on scattering processes is Goldberger and Watson (1964). See also 
Newton (1982). A fine textbook is Taylor (1972). 
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itween states that evolve in time under the action of an unperturbed Hamiltonian 
o whose eigenstates are defined by 

H 0 \n)= E„\n) (20.1) 

s in Chapter 13, nonrelativistic elastic scattering of a particle without spin from a 
sed potential will be considered, since this process, to which we refer as simple 
mattering, is the prototype of all more complex processes. The Hamiltonian is 

p 2 

H = H 0 +V=*-+V (20.2) 

2m 

lthough we assume that H 0 is simply the kinetic energy operator, in more sophis- 
cated applications H 0 may include part of the interaction. The interaction operator 
is assumed to be time-independent. 

In the interaction picture, the solution of the equation of motion may be written 
terms of the eigenvectors of H 0 as 

I TO) = 2 \k)(k\T(t, f 0 )|j><s| *(*>)> (20.3) 

k,s 

ccording to (19.15), the equation of motion for the transition amplitudes is ex- 
■essible as 

(k\T(t, f 0 )|i) = - ~ 2 <fc| V|») f e io, ^'(n\f(t', *b)|s> df (20.4) 

n n Jt 0 

here 

h<Okn = E k - E n (20.5) 

id where it has been assumed that the unperturbed eigenvectors are normalized to 
lity: 

(k\n) = 8*. (20.6) 

t 

bus means that, if the unperturbed states are plane waves, periodic boundary con- 
tions must be imposed in a large box of volume L 3 . Eventually, the limit L —> oo 
ay be taken. Typically, in simple scattering of spinless particles, the unperturbed 
ates of interest arc plane wave states e' k '7L 3/2 in the coordinate representation, 
here k stands for the incident or scattered propagation vector, quantized as in 
jetion 4.4 to satisfy the periodic boundary conditions. 

To describe scattering properly, we should choose the initial state |'*I r (i 0 )) in 
0.3) to represent a wave packet moving freely toward the interaction region. Here 
e employ a mental shortcut and idealize the wave packet as an incident plane wave, 
aking the unrealistic assumption that somehow at time t 0 the scattering region has 
sen embedded in a perfect plane wave that is now being released. By pushing the 
itial time into the distant past and letting t 0 — » — oo, we avoid the unphysical 
msequences of this assumption, which brings confusing transients into the calcu- 
tion. Similarly, t — » + oo signals that the scattering process is complete. We thus 
smand that the transition matrix element between an incident, or “in,” state s and 
scattered, or “out,” state k, (k\T(t, i 0 )k). converges to a well-defined limit as 
— oo and t — > + oo . Scattering theory requires us to solve the coupled equations 
0.4) subject to the condition of the existence of the limit 

<k|f(+°°, -c°)\s) 


(20.7) 
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for the matrix elements of the time development operator between unperturbed 
asymptotic in and out states. This matrix is called the scattering matrix, or simply 
the S matrix ; it plays a central role in this chapter. We encountered it in special 
cases earlier, in Chapters 6, 13, and 16. 

To solve the equations for the transition matrix elements, we recall that in first- 
order perturbation theory we would write 

(k\T(t, to)\s) = S ks - l - <*| V|j> £ e^ 1 ' dt[ (20.8) 

Using this expression as a clue, we devise the Ansatz: 

(k\T(t, to)\s) = Tks £ dt' (20.9) 

In generalizing (20.8) to (20.9), the known matrix (&|y|,s) has been replaced by an 
unknown matrix T ks in the expectation that the perturbation approximation might be 
avoided. To make the integral meaningful as t 0 — > — °°, a factor e at has been inserted 
in the integrand with the understanding that a is positive and that the limit a -*• 0 
must be taken after the limit t 0 —■ > — Equation (20.9) will be assumed to give 
(k\T(t, t 0 ) | s) correctly only for times t that satisfy the relation 

\t\ « (1/a) (20.10) 

It is essential to keep these restrictions in mind. If they are disregarded, the equations 
of the formal theory may lead to painful contradictions. Such contradictions easily 
arise because the formal theory is designed to be a shorthand notation in which 
conditions like (20.10) are implied but never spelled out. The formal theory thus 
operates with a set of conventions from which it derives its conciseness and flexi- 
bility. Those who consider the absence of explicit mathematical instructions at every 
step too high a price to pay can always return to the wave packet form of the theory. 
The connection between the two points of view is never lost if it is noted that 1/a 
measures crudely the length of time during which the wave packet strikes, envelops, 
and passes the scattering region. If v is the mean particle velocity, v/a is roughly 
the length of the wave packet. 

Having given some motivation for the form (20.9), we now ask if the T ks , known 
as the transition matrix elements, can be determined so that (20.9) is the solution of 
(20.4). In the next section, we will show that under the conditions prevailing in 
scattering problems such a solution does indeed exist. Moreover, it is rigorous and 
not approximate. 

Assuming the existence of this solution, we can draw an important conclusion 
immediately. Upon integrating (20.9), we obtain 

T e io >ks t+at 

(k\T(t, -oo)| s > = + — * — — (20.11) 

n(-(*>ks + lot) 


as lim e at ° = 0. In the limit t — > +00 and a — > 0, but subject to condition (20.10), 
0 

t 0 -+-oo 

and using Eq. (A. 17) in the Appendix, we obtain: 


S ks = (k\T(+<x,, - 00 ) | s) = 8 ks - 2mS(E k - E s )T ks 


( 20 . 12 ) 


This formula provides an important connection between the S matrix and the tran- 
sition matrix. Kronecker deltas and delta functions are mixed together in this ex- 
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loice of representation for the states labeled by s and k. If that choice is eventually 
ade in favor of the propagation vectors k (the momentum of units of h) and if 
normalization is used as L — * °°, the 8 ks turns into §(k' — k). 

For states k s, we thus have at finite times, during the scattering process 

\m<, ->i«>r - < 20 - 13 > 

ence, for the rate of transition into state k, 

i \(k\T(t, -“)WI 2 = -r^i p |r to | 2 (20.14) 

at (o ks + a ft 

the limit a — » 0, which must always be taken but at finite values of t, this becomes 

s] 2 77 

- 8(a> ks )\T ks \ 2 = — 8(E k - E s )\T ks \ 2 (20.15) 

k f s. The solution thus implies a constant transition rate — precisely what we 
;pect to be the effect of the scatterer causing transitions from state s to k. This 
akes it clear why T ks is called the transition matrix. 

Equation (20.15) is meaningful only if there is a quasi-continuum of unper- 
rbed states with energies E k ~ E s . In Section 20.2, we will demonstrate that the 
atrix T ks exists and that in scattering problems (20.4) has solutions of the form 
0.9). 

If the theory is to be useful, we must establish the connection between the 
ansition rate, (20.15), and the scattering cross section. The unperturbed states are 
)w assumed to b„e normalized momentum eigenstates. Since 

fYl 

8(E k - E k .) = -8(k- k') 


e obtain for the total transition rate from an incident momentum state k into a 
•lid angle dfl. 


» = 2j, Kk’lfa. -»)|k>| 2 = Y da j 0 *<* - *'>l r ‘--‘l 2 *' 2 ^ dk ' 


mkL 3 
(27t) 2 ^ 3 


|T k - k | 2 dCL 


(20.16) 


here k' is the momentum of the scattered particle ( k ' — k). The factor (L/2-7T) 3 is 
e k-space density of free-particle states in the cube of length L, subject to periodic 
mndary conditions [see Eq. (4.53)]. Hence, if v = hk/m is the velocity of the 
cident particles, the transition rate reduces to 


w — 


m 2 vLr 
(: iTTffi 4 


■ k'k I 


dn 


(20.17) 


tie probability of finding a particle in a unit volume of the incident beam is 1/L 3 . 
ence, v/L 3 is the probability that a particle is incident on a unit area perpendicular 
the beam per unit time. If this probability current density is multiplied by the 
fferential cross section da, as defined in Section 13.1, the transition rate w is 
itained; hence, 


da 


w 

v/Lf 



■ k'k I 


dfl 


(20.18) 
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If (&|V|s) is used as an approximation instead of T ks , (20.15) is equivalent to the 
Golden Rule (19.99) of time-dependent perturbation theory, and the cross-section 
formula (20.18) reduces to the Born approximation of Section 13.4. 


2. The Integral Equations of Scattering Theory . We now substitute 
(k\f(t, — «)|j) from (20.11) into (20.4) and immediately set at = 0 in accordance 
with the restriction (20.10). The matrix T ks must then satisfy the system of simul- 
taneous linear equations, 


= <*|v|5> + 

ft n 


(k\V\n)T ns 

-(Dns + ia 


(20.19) 


If the transition matrix elements satisfy this equation, the expression (20.11) is a 
solution of the equation of motion (20.4) for times |r| « 1 la. 

The summation in (20.19) extends over all unperturbed eigenstates. Hence, it 
includes states for which E n — E s . The corresponding denominator, —(o ns + ia, 
vanishes in the limit a —> 0, and we would thus be led to a meaningless equation if 
the spectrum of H 0 were truly discrete. The unmanageable singularity can be averted, 
and the transition matrix can be expected to exist if the unperturbed states form a 
quasi-continuum with energies very close to the energy E s of the initial state. Pre- 
cisely this situation prevails in scattering, where the initial and final states have the 
same, or very nearly the same, energies. The transition amplitudes and the matrix 
elements T ks are then proportional to 1/L 3 . Hence, the amplitudes of the final states, 
k f s, will remain small, at least for t « 1/a. It follows that the initial state is not 
appreciably depleted over a period of the order 1/a, and we can have a constant rate 
of transition. If the initial state, instead of being part of a quasi-continuum of en- 
ergies were a truly discrete state, the method of solution outlined here would fail. 
Such a state decays exponentially and, as was seen in Section 19.8, an altogether 
different solution to the equation of motion is appropriate. 

Equation (20.19) can be used to connect the formal theory with the more explicit 
description of scattering in Chapter 13. For this purpose, it is convenient to define 
a set of vectors ^ +) in Hilbert space by the linear equations, 

T ks = 2 On, Vi +) ) = (¥*, V¥< +) ) (20.20) 


Substituting this scalar product in (20.19), we obtain an equation that ^ +) must 
satisfy: 


on, V^i +) ) = 2 on, V'V n ) 

n 


on, v^[ +> ) 

E s — E n + iha 


+ 


on, v%) 


or, since this must be true for all k 


¥< +) = % + 

n 


on, v^< +) ) 

E s — E n + iha 


= 'n + 2 

n 


1 

E s — H 0 + iha 


'non, v^n +) ) 


( 20 . 21 ) 


However, by completeness. 


2 'non, F'n + o = v'n +) 


( 20 . 22 ) 
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;nce, we obtain as a final result the implicit equation 


¥< +> = + 


— v^< +) 

E s — H 0 + iha 


(20.23) 


lis is a fundamental equation of the formal theory of scattering. The problem of 
Gaining the transition matrix has thus been reduced to solving (20.23), known as 
s Lippmann-Schwinger equation. 

By applying the operator, E s — H 0 + iha to (20.23), it is immediately estab- 
hed that in the limit a 0, 

(£, - tf 0 )¥< + > = W ( s +) (20.24) 

iat is, 1 P' +) becomes an eigenvector of H — H 0 + V, and E s the corresponding 
^envalue. Hence, the procedure of solving the equation of motion outlined in this 
apter can succeed only if, in the limit a -» 0, E s is an eigenvalue of both H 0 and 
In simple scattering systems where V — > 0 as r — » °° , both Hamiltonian operators 
ve the same continuous spectrum: The energy can have any value between 0 and 
, and the presence of the potential does not change this fact. Throughout this 
apter it will be assumed that the continuous portions of the spectra of H 0 and H 
incide and extend from E — 0 to although for more complicated systems this 
a severe restriction that cannot and need not be maintained. In addition, H may 
ssess discrete eigenvalues that have no counterpart in the spectrum of H 0 , corre- 
onding to bound states. 

Actually, the periodic boundary conditions on the cube of length L cause the 
tire spectra of both H 0 and H to be discrete. Even the quasi-continuous energy 
/els of H 0 and H will then generally not coincide, because the two operators differ 
V. But the shift, A E k , of each level goes to zero as L —> oo. The limit L —> °° 
jst be taken before the limit a — > 0, since via signifies approximately the length 
the incident wave packet and we must require that L > via, or else the normal - 
ition box could not contain the wave packet. 2 If the limit f — » a> is accompanied 
a change of normalization of positive energy states from unity to k-normalization, 
defined in Section 4.4, the formal identification L = 2tt allows the same equations 
be used after the limit has been taken as before. 

In retrospect, it is instructive to rederive (20.23) by making the eigenvalue 
uation (20.24) the starting point, and to ask for a solution of the eigenvalue prob- 
n of H, assuming that the fixed eigenvalue E s lies in the continuous spectrum that 
common to both H and H 0 . Equation (20.24) may then be regarded as an inho- 
ageneous equation to be solved. Its homogeneous counterpart, 

(H 0 - E s )% = (H 0 - E s )\s) = 0 (20.25) 

es have solutions (by assumption). According to the theorems of Section 18.2 the 
lomogeneous equation has a solution if and only if the inhomogeneous term 
P^ +) is orthogonal to the subspace spanned by the solution of the homogeneous 
uation (20.25). This condition is generally not satisfied. We can nevertheless ig- 
re this apparent obstacle in solving (20.24), because, when the spectrum is quasi- 


2 In the formal scattering theory, the order in which the several limiting processes are executed is 
■y important. No simple mathematical notation has yet been invented for operators that depend on 
ameters whose limiting values are to be taken after the operation. 
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continuous, the component of V A J^ +) in the troublesome subspace can be neglected 
without any appreciable alteration. 

According to Section 18.2, the general “solution” of the inhomogeneous equa- 
tion (20.24) is 


^< +) = % + G + (E s )VW ( s +) (20.26) 

where G+(E S ) is $ particular solution of the operator equation 

(£, - H 0 )G(E S ) = 1 - P s (20.27) 

This equation has infinitely many different solutions. In order to obtain the solutions 
M f i +) we must obviously make the choice 


G + {E S ) 


1 

E s — H 0 + iha 


(20.28) 


since then (20.23) and (20.26) become identical. The operator (20.28) is indeed a 
particular solution of (20.27), since ( E s — H 0 )G+(E S ) gives zero when it acts on an 
eigenvector of H 0 with eigenvalue E s and, in the limit a — » 0, has the effect of the 
identity operation when it acts on an eigenvector T r r of H 0 with E r ±E s . 

The implicit equation (20.23) for T r ^ +) looks like the integral equation (13.28), 
and the two equations are equivalent if the unperturbed eigenstates M*,. are momentum 
eigenstates. To prove this, it is best to go back to (20.21) and write it in the coor- 
dinate representation. If we set 


<r|TA +) > = <Ak +) ( r ) (20.29) 

and 


(r\%) = 


,ik-r 


(277) 


3/2 


we obtain for the local potential (r'| V|r") = V(r)8(r' - r"), 


(/2 +) (r) = — - — e ikr — f — 

^ 1 1 (2tt) 3/2 (27 r ) 3 J k' 7 


jk'(r-r') 


(k 2 + is) 


2m 

d 3 k’ -r V(r') </4 +) (r') d 3 r' 

ft 


(20.30) 


By comparison with the equations of Section 13.3, we see that after the k '-integra- 
tion is performed, this equation becomes identical to the integral equation (13.39). 

The preceding discussion shows that the Green’s function G + (r, r') is propor- 
tional to the matrix element of the operator 


G + (E) = (E - H 0 + ihay 1 


in the coordinate representation: 


G + (r, r') = -4^<r|G + (£)|r'> 


(20.31) 


As shown in Chapter 13, this particular Green’s function ensures that the solutions 
</r (+) of the integral equation (13.28) asymptotically represent outgoing spherical 
waves in addition to incident plane waves. 
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Exercise 20.1. Work out the Green’s function ( x j G+(E ) |x') for a free particle 
one dimension. 


le operator G+ is only one particular solution of (20.24). The particular solution 


G}E S ) 


1 

E s — H 0 — iha 


(20.32) 


' (20.27) leads to the replacement of (20.26) by the equation 


¥<“> = % + G}E S )V ’¥< _) (20.33) 

M', is a momentum eigenvector, the eigenstate 4 f £ _) of H, when projected into the 
>ordinate representation, describes asymptotically spherically incoming waves. 
Another solution of (20.27) which is often useful is the operator 


GfEJ = i G + (E S ) + ^ G-(E S ) = X - 


1 


+ 


1 


H 0 + iha E s — H 0 — iha. 


he expression on the right may be simplified and written as [Eq. (A. 20) in the 
ppendix]: 


G\(E) = P 




(20.34) 


/ith this operator, we can define a set of eigenstates 4* (1) by the equation 


¥ (1) = 4 ^ + 



(20.35) 


l the coordinate representation, the operator G X (E) is related to the standing wave 
rreen’s function (13.38). 

The solution of equations like (20.26), (20.33), and (20.35) depends on the 
tioice of the free particle states 4V To guard against confusion with commonly 
sed terminology, we note that, although for simple scattering the states j4^ +) ) in 
le coordinate representation represent asymptotically outgoing spherically scattered 
raves, they correspond to the “in” states of particles with momentum hk, before 
tey reach the scattering region. Similarly, the states represent incoming 

pherically scattered waves, but correspond to the “out” states of particles with 
lomentum hk ' , after the scattering has taken place. 

If we identify the unperturbed initial and final states as the momentum eigen- 
tates, |k) and | k '), and make the proper adjustment for box normalization, the 
•ansition matrix can, by (20.20), be expressed as 

T k k = (4V, m +) ) - | r V(r)«AL +) (r) d 3 r (20.36) 


Comparing this with (13.43), we find a simple relation between the transition matrix 
nd the scattering amplitude : 


[ k'k 


Irrh 2 

ml} 


A(k') 


(20.37) 
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In the limit L —> °o, and with k- normalization, this equation turns into 

-(2tt) 2 mT k , k = h 2 f k ( k') 

Substituting T k - k from (20.37) into (20.18), we obtain 

d(T= l/ k (k ')[ 2 dfl 

4 

This result is identical with (13.23). 


(20.38) 


(20.39) 


3. Properties of the Scattering States. The fundamental problem of scattering 
theory is to solve the equation 

^ +) = % + J ^ .» VV ( s +) (20.23) 

E s — H 0 + iha 

The solutions can then be used to determine the transition matrix which, according 
to the last section, is directly related to the cross section. 

Formally, we may solve (20.23) by multiplying it by E s - H 0 + iha, and adding 
and subtracting —Vty s on the right-hand side of the equation. Thus, we obtain 

(E,-H + iha)V ( s +) = (E s — H + iha)% + V% 


or 


V ( s +) = % + 


V'P' 

E s — H + iha 


(20.40) 


The important distinction between this equation and (20.23) is the appearance of H 
rather than H 0 in the denominator. 

If the solution (20.40) is substituted in (20.20) for the transition matrix, we get 

T k < = (%, V%) + (v k , V £ _ - J + (20.41) 

In this way, the cross section for a scattering process can in principle be calculated. 
However, for practical purposes not much is gained, because the effect of the op- 
erator (E s — H 4- iha)~ l is not known unless the eigenvectors of H have already 
been determined. Since this is the problem we want to solve, it is usually necessary 
to resort to approximation methods to solve (20.23). 


Exercise 20.2. Show that, if E r = E„ 

T rs = (^r ) , V'PJ (20.42) 

The crudest approximation is obtained if in (20.23) the term proportional to V 
on the right-hand side of the equation is neglected altogether: 

^ +) = % (20.43) 

This is simply the first term in the solution of (20.23) obtained by successive ap- 
proximation. If we define the resolvent operator, 

1 

E — H + iha 


«+(£) = 


(20.44) 
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q. (20.40) can be written as 

= % + <g + (E s ) V% 


sing the identity, 

<§ + (E) = G+(E) + G + (E)V<$ + (E ) 


e find by iteration the operator identity, 

n 

<s+ = [2 (g + v?]g + + (G + vy +l % + 

4=0 


Exercise 20.3. Prove the operator identity 


1 _ 1 
A~ B + 




(20.45) 

(20.46) 


(20.47) 


(20.48) 


id exploit it to verify (20.46). 


The expansion (20.47) applied to Eq. (20.40) produces the formal series ex- 
insion 


¥< +) = % + G + (E S )V% + G + (E S )VG + (E S )V% + • ■ • 


(20.49) 


he nth Born approximation to the scattering state 4 /( , +) consists of terminating the 
cpansion (20.49) arbitrarily after n terms. 


Exercise 20.4. Show that formally the series (20.49) is also arrived at by 
writing (20.26) as 




1 

1 - G + (E S )V 


% 


(20.50) 


id expanding (1 — G+V) 1 as a power series. 


The convergence of the Born series (20.49) is often difficult to ascertain, but it 
easy to see that it will certainly not converge, if the equation 

A 0 G + (E S )VV = V 

is an eigenvalue A 0 whose absolute value is less than 1. The operator (1 — AG+V) -1 
is a singularity at A = A 0 ; consequently, the radius of convergence of the series 
cpansion of this operator in powers of A must be less than |A 0 |. If 
L 0 j < 1, the Born series, which corresponds to A = 1, is divergent. If, as frequently 
ippens in cases of practical interest, the Born series fails to converge or converges 
>0 slowly to be useful, more powerful, but also more involved, approximation tech- 
ques are available for the determination of ^P (+) . 

If the first Born approximation (20.43) is substituted into the transition matrix, 
e obtain from (20.36) and (20.37): 


(20.51) 
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in agreement with (13.48). The first Born approximation is the result of a first-order 
perturbation treatment of scattering, in which the accurate equation (20.9) is replaced 
by the approximate equation (20.8). 

The formal solution (20.40) can be used to demonstrate the orthonormality of 
the eigenvectors ''J /(+> . This is seen by the following simple manipulations: 


^< +) ) = [% + 


1 


E t - H + iha 


V%, ^< +) 


= ^< +) + v 

= ^ +) + v 

! = (v k , v ( s +) - 

If we finally use (20.23), we get the result 


1 


^ +) J 

N 

\p(+) 


E k — H — iha 

1 

\J/ 

E k — E s — iha s / 

- — v^< +) ) 

E s — H 0 + iha J 


( ^(+), ^(+>) = = 8ks (20.52) 


This formula is valid only in the limit L — > °°, when E r becomes an eigenvalue of 
both H and H 0 . Entirely analogous arguments can be made to show that 

0K“\ Vf 7 ) = S ks (20.53) 

Corresponding to an orthonormal set of we thus obtain two sets, ^ +) and 
of orthonormal eigenvectors of the total Hamiltonian H. The question that arises is 
whether these sets are complete. It would appear that each set by itself is a complete 
set, because the vectors ^ form a complete set, and ^ +) (or ^ _) ) goes over into 
as V — > 0. However, one reservation is called for: The Hamiltonian H may have 
discrete energy eigenvalues corresponding to bound states produced by the inter- 
action V. These discrete states, which have no counterpart in the spectrum of H 0 and 
are never found among the solutions of (20.20), are orthogonal to the scattering 
states and must be added to all the 1 P^ +) (or Nt r ^ -) ) to complete the set of eigenvectors. 


4. Properties of the Scattering Matrix. The S matrix was defined in terms of 
time-dependent transition amplitudes in Section 20.1. It is related to the transition 
matrix, and therefore to the energy eigenstates of H and Hq, by Eqs. (20.12) and 
(20.20), leading to: 

= 8 b - 2mS(E k - E s )T ks = 8 b - 2m8(E k - E s )(V k , V^ +) ) (20.54) 

The S matrix connects the initial state with the final states of interest. An alternative 
approach to the S matrix is to think of scattering as described by idealized stationary 
states. If, owing to its preparation, the system is known to be in the eigenstate 
T r j +) of the full Hamiltonian H, the S matrix element S ks is the probability amplitude 
for detecting the system in state Therefore, when we express ' v I r ^ +) as a linear 
combination of the all belonging to the same energy eigenvalue, 

^ +) = 2 Vi~ 7 S b (20.55) 

k 

we expect that the expansion coefficients S ks are the elements of the S matrix. From 
the orthonormality of the scattering states, we obtain 

Sks = 0K~\ ^ +) ) 


(20.56) 
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this section, we prove that the two expressions (20.54) and (20.56) are indeed 
ual. The definition (20.55) implies that the S matrix is diagonal with respect to 
j energy. The 5-matrix elements between states of different energy are zero, as is 
en explicitly in the formula (20.54). 

In the case of simple scattering, the incident state is represented asymptotically 
the free particle “in” state = |k) which feeds the outgoing spherical wave 
sociated with the state ^ k +) . In a scattering experiment, we are asking for the 
obability amplitude that the particle is asymptotically found in the free particle 
>ut” state = | k '), which is fed by the incoming spherical wave associated 
th the state ^ k “\ The expansion (20.55) takes the explicit form 

Vi +) = X = X 'n~ ) <k'|5|k> (20.57) 

k' k' 

iere we have defined the scattering operator S by its matrix elements, 

<k' |5|k> = 0H7>, n +) ) = S * k (20.58) 

tween the “in” state jk) and the “out” state |k'}. 

To prove the equality of the representations (20.54) and (20.56), we use the 
rmula 


+ 


1 


Ei, — H — iha 


VV k 


(20.59) 


lich is the analogue of (20.40) for ) and substitute it in (20.56): 

/ \ 

1 


S ks = OP*, ^ +) ) + 
= + 


k E s — H — iha 
1 


VV k , ¥< +) 


E s — H 0 + iha 


\AP ( , +) 


) + ( i 

) \E k -H 


— iha 


W k , V ( s +) 


iere the Lippmann-Schwinger equation (20.23) has been ujsed in the last equality, 
nee M** is an eigenstate of H 0 and 1 P$ +) is an eigenstate of H, we can reduce this 
pression to 


Sks ~ + I - 


ks 


E k E« 


iha 


+ 


— E s + iha 


CP*. V^ +) ) (20.60) 


sing a standard representation of the delta function, Eq. (A. 21) in the Appendix, 
2 recover the formula (20.54) for the 5-matrix elements. 

If plane waves with periodic boundary conditions in a box of volume L 3 are 
losen for the set of eigenvectors ' V P J , the scattering matrix becomes 

S k ' k = 5 k k + 8 ( k ~ k')fk(k') (20.61) 


s we saw in Section 20.2, the differential scattering cross section must be propor- 
mal to | S k ' k - S k - k | 2 or |/ k (k')| 2 . 

From (20.55) and (20.20) it is easy to derive the simple formula 


T js = X vyf>)S, 


(20.62) 
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This also follows directly from the definition of the scattering matrix. If Ej = E s , 
on account of (20.42) it may also be written as 

T js = S T% S rs (20.63) 

r 

The scattering matrix owes its central importance to the fact that it is unitary. 
To prove the unitary property in the time-independent formulation, we must show 
that 4 

X S* nt S nj = 8 {J (20.64) 

n 

and 

X S (n S% = 8 eJ (20.65) 

n 

The first of these equations follows immediately from the definition (20.55) and the 
orthonormality of the ^ <+) and The second equation is proved by using (20.56) 
to construct 


2 S en S*n = S cm ^n +) )(^i +> , ^.r } ) 

n n 

If '[ f i b) denotes the bound states, we have as a result of completeness the closure 
relation 


2 <*> fc ) + 2 (<S>«, ® b ) = (<*>«, ® b ) 

n i 


for any two states and <& fo . Applying this to the previous equation, we get 

X S en S% = <pn ¥}">) - 2 (?n ¥<">) = = 8 ej 

n n 


because the bound states are orthogonal to the scattering states. 

In the time-dependent formulation, the 5-matrix element S ks is by its definition 
(20.7) equal to the matrix element of ?( + °°, — 00 ) between the initial unperturbed 
state ^ and the final unperturbed state MV Hence, formally, 


S = it +00, -oo) 


( 20 . 66 ) 


The time development operator is unitary, and thus the operator S is unitary, as 
expected. The 5-matrix element S ks is the probability amplitude for finding the sys- 
tem at t — + oo in state ^ k if it was known to have been in state at t — — oo . 

The connection between the two definitions of the 5 matrix is further clarified 
by the two equivalent expressions for the stationary scattering states: 

¥< +) = % + G + (E s )W c s +) (20.67) 

which is the Lippmann-Schwinger equation, and 

¥< +> = 2 PM*. + (20.68) 

k 

obtained by substituting (20.33) into (20.55). If we now imagine that wave packets 
are constructed from these expressions by superposition, we may relate them to the 
time-dependent description of the scattering process. At t = — °°, only the first term 




io 


Chapter 20 The Formal Theory of Scattering 


(20.67) contributes, and at t — + °°, only the first terms in (20.68) contribute, 
ace the retarded (G+) wave vanishes before the scattering and the advanced (G_) 
ave vanishes after the scattering. Hence, the matrix S ks connects the free initial 
ate s with the free final states k as described by 


- <*|s|j> = <fc| f(+°°, -oo)\s) 


(20.69) 


The scattering operator S is useful because it depends only on the nature of the 
stem and the partition of the Hamiltonian H into a “free” Hamiltonian H 0 and an 
teraction V, but not on the particular incident state. A simple application to scat- 
ring from a central-force field in the next section will illustrate the advantages of 
orking with the scattering operator. 

For formal manipulations in scattering theory, it is sometimes convenient to use 
e operators T, fl (+) , and fl (_) defined by the equations 

T^ = (¥*, T%) = <*|r|j> = or*, (20.70) 

id 

ft (+) ¥ r = ¥< + \ = i rr ) for all r (20.71) 

om the last definition it is evident that fl (+) and preserve the norm of vectors 
i which they act. Nevertheless, in general they are not unitary (only isometric ), 
icause the full Hamiltonian H = H 0 + V may support discrete bound eigenstates, 
hich can be expanded in terms of the unperturbed states M r r , but are orthogonal to 
e states ''TO'- 1 and _) . Therefore, fl (+)t fi (+) = fl (-)+ fl (_) = 1, but f — 
(_) fl (_)t = 1 is not necessarily valid. 3 


Exercise 20.5. Prove that 

t = rn (+) (20.72) 

id 

5 = [a ( " ) ] + 0 (+) * (20.73) 

Exercise 20.6. Show that 

Ml (±) = a (±) H 0 (20.74) 

id from this relation deduce that 

[S, H 0 ] = 0 (20.75) 

id verify again that the S matrix is diagonal with respect to the energy. 

Rotational Invariance, Time Reversal Symmetry, and the S Matrix. If a spa- 
ll rotation is applied to all the states of a physical system in which scattering 
;curs, the initial and final momentum states are rotated rigidly. According to Sec- 
m 17.2, this is accomplished by applying a unitary operator U R . If the forces are 
mtral, V = V(r), the Hamiltonians H and H 0 are both invariant under rotations, 
id the scattering matrix will be the same before and after the rotation. In this case 
e have 

(U R k'\S\U R k) = <k'|S|k> 

3 For a more detailed exposition, see Taylor (1972). 
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Hence, the scattering matrix cannot depend on the absolute orientation of the vectors 
k and k ' in space. It can only be a function of the energy and of the angle between 
the initial and final momenta. If the particles have no spin, the completeness of the 
Legendre polynomials allows us to write the scattering matrix in the form 


<k'|S|k> = S(k - k') 2 F t {k)P t { k • k') (20.76) 

e=o 

A 

with undetermined coefficients F e (k). The delta function has been included as a 
separate factor, because we already know that the S matrix has nonvanishing ele- 
ments only “on the energy shell,'’'’ i.e., between two states of the same energy. 

The coefficients F t (k) can be determined to within a phase factor by invoking 
the unitarity of the scattering matrix: 

I <k'|Sjk"><k|S|k">* d 3 k" = 8 ( k - k') 

Substituting (20.76) here and carrying out the integration in k"-space by recourse 
to the addition theorem of spherical harmonics, Eq. (11.100), we obtain 

CO A 

^(k - k') 2 | F e (k) \ 2 P f (k • k') = <5(k - k') 

^=0 1 


We now use the identity (11.105) to write: 


<5(k - k') 


8(k- k') ^ 2€ + 1 - - 


(20.77) 


From the last two equations, we immediately find that the coefficients F e (k) must be 
of the form 


F e ik) = 


21 + 1 
47rk 2 


e 2i8gk) 


(20.78) 


where the 8 { (k) are real functions of the momentum (or energy). Inserting this result 
in (20.76), we conclude that the scattering matrix is expressible as 

00 of + 1 

<k ' | jS) k> = 8(k - k') 2 , , 2 e 2iS ‘ <k) P e ( k • k') (20.79) 

e =o 4 irk 


On the other hand, according to (20.61) and (20.77), this matrix element can also 
be written in terms of the scattering amplitude as 


(k ' | S | k> = 8(k- k 


oo 

') 2 

_( = 0 


21 + 1 
47 rk 2 


Pe ( k • k') + 


27 rk 


/*( k') 


(20.80) 


Here we have chosen the dimension of the normalization cube L = 2tt, so that the 
unperturbed eigenstates are normalized as (k'|k) = S(k - k') throughout. Com- 
paring (20.79) and (20.80), we get 


1 “ 

/*(k') = t 2 (2€ + l)e 2iSe<k) sin 8 t (k)P t (. k • k') 
k «=o 


(20.81) 


We have thus rederived the main result of the partial wave analysis (13.70) directly 
from the rotational invariance of the scattering operator. 
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Exercise 20.7. Transform the matrix element (20.79) into the orbital angular 
omentum representation (see Exercise 17.40), and show that 

(crf'm'\S\ aim) = e 2iS < ik) S m>m 8 re (20.82) 

:rifying that e 2iSi(k) are the eigenvalues of the S matrix for a rotationally invariant 
teraction, in agreement with Eq. (13. 75). 4 


In order to analyze the symmetry properties of the scattering states under the 
ne reversal operation, we write the fundamental integral equations using momen- 
m eigenstates as the unperturbed states: 


^k +) = + 


y»Jr(+> 

E k — H 0 + iha 


(20.83) 


id 


^k _) = ^k + 


— V 

E k — H 0 — iha 


(20.84) 


If we now apply the antiunitary operator © defined in Section 17.9 and choose 
e phases of the momentum eigenstates such that ©'Pr = 'P-r, we obtain 

= ^_ k + 1 — ©y©-i©^+> (20.85) 

E k — H 0 — iha 

here use has been made of the invariance of H 0 under time reversal. Comparing 
is relation with (20.84), we observe that and ^-Tr are mutually time-reversed 
ates, 

©^k +) = ^ (20.86) 

the interaction V is invariant under time reversal: 

©V© -1 = V ‘ (20.87) 


this case, the S matrix satisfies the condition 

<k'|S|k> = <p£r\ ^L +) ) = (20.88) 

= (VU’, V-J) = <-k|5l-k') 


ving to the antiunitary property of ®. For the scattering amplitude, this implies by 
0.54) the relation 

/k(k') = /- k'(-k) (20.89) 

ais equation, derived from very general symmetry properties, expresses the equal- 
ly of two scattering processes obtained by reversing the path of the particle and is 
lown as the reciprocity relation. 


The Optical Theorem. From the unitary property of the scattering matrix, we 
in derive an important theorem for the scattering amplitudes. If we substitute the 

4 The same symbol (8) is used for delta functions and phase shifts in this section, but the context 
ways determines the meaning unambiguously. 
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expression (20.54) for the S matrix in the unitarity condition (20.64) and work out 
the result, we get 


2tt 2 8(E n - E r )T* r T nj = i(T rj - T* r ) 

n 


(20.90) 


By (20.37), this formula can also be written in terms of the scattering amplitudes. 
Replacing the summation by an integration, we obtain by use of the appropriate 
density-of-states factor 


2ir 


2'nh 2 


mLr (2 tt'Y 


I 


it(k")jv(k") 


m 

¥ld' 


S(k" - k ) k " 2 dk" d^l" = -i[f k .(k) - jt(k')] 


or 


f 


/k(k")/k'(k") da" = — 


4'Jr fk' (k) - /k(k') 


2 i 


(20.91) 


As a special case of this relation, we may identify k' and k and then obtain by 
comparison with (20.39): 


A \tt 

(J = dcr = |/ k (k")| 2 da" = — Im / k (k) 

J J K 


(20.92) 


This formula shows that the imaginary part of the forward scattering amplitude 
/ k (k) measures the loss of intensity that the incident beam suffers because of the 
scattering. It therefore expresses the conservation of probability, which is a conse- 
quence of the Hermitian property of the Hamiltonian. The unitarity of S is directly 
linked to the Hermitian property of H, since according to (20.66) S is the limit of 
the time development operator (in the interaction picture). 

Equations (20.90), (20.91), and especially (20.92) are generically known as 
expressions of the optical theorem, because of the analogy with light that passes 
through a medium. In optics, the imaginary part of the complex index of refraction 
is related to the total absorption cross section. Application of the optical theorem to 
scattering from a central-force potential was the subject of Exercise 13.14. 

Exercise 20.8. Derive the optical theorem (20.90) directly from (20. 11), using 
conservation of probability. 

Exercise 20.9. Show that the first Born approximation violates the optical 
theorem. Explain this failure and show how it can be remedied by including the 
second Born approximation for the forward scattering amplitude. 


Problems 

1. Obtain the “scattering states” (energy eigenstates with E > 0) for a one-dimensional 
delta-function potential, gd(x). Calculate the matrix elements ( k ' |S|fc) and verify the 
unitarity of the S matrix. Obtain the transmission coefficient, and compare with Eq. 
(6.19) and Exercise 6.13. Perform the calculations in both the coordinate and mo- 
mentum representations. 
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Use the Born approximation to calculate the differential and total cross sections for 
the elastic scattering of electrons by a hydrogen atom that is in its ground state. 
Approximate the interaction between the continuum electron and the atom by the 
static field of the atom and neglect exchange phenomena. 

The cross section for two-quantum annihilation of positrons of velocity v with an 
electron at rest has, for v « c, the “classical” value 



Use this information to derive the annihilation probability per unit time from a plane 
wave state and (assuming that annihilation occurs only if the two particles are at the 
same place) estimate the decay probability in the singlet ground state of positronium. 

Using the Born approximation, and neglecting relativistic effects, express the differ- 
ential cross section for scattering of an electron from a spherically symmetric charge 
distribution p(r) as the product of the Rutherford scattering cross section for a point 
charge and the square of a form factor F. Obtain an expression for the form factor 
and evaluate it as a function of the momentum transfer for (a) a uniform charge 
distribution of radius R, and (b) a Gaussian charge distribution with the same root- 
mean- square radius. 

If the nonlocal separable scattering potential 

<r'|V|r") = A u(r')u(r") 

is given, work out explicitly and solve the integral equation for ^ (+) . Obtain the 
scattering amplitude, and discuss the Born series for this exponential. 



CHAPTER 21 


Identical Particles 


When we endeavor to describe the quantum behavior of systems of 
several identical particles, altogether new, strange, and unclassical 
features are encountered because there is no way of keeping track of each 
particle separately when the wave functions of two indistinguishable 
particles overlap. The indistinguishability of identical particles has 
important, experimentally verifiable consequences. For instance, the 
spectra of many-electron atoms would be altogether different if the atomic 
electrons were distinguishable. The Pauli exclusion principle, which 
explains the periodic table of elements and the stability of complex atoms, 
presupposes the indistinguishability of electrons. The quantum theory for 
identical particles is most efficiently formulated in terms of operators that 
represent the occupation number, or density of occupation, of one-particle 
states and the related creation and annihilation operators. Since it affords 
a direct route to quantum field theory, and the physics of particles and 
quasiparticles which can be created and destroyed, we will work in this 
so-called second quantization mode, but the connection to wave 
mechanics in configuration space will also be displayed. By extending the 
principles of one-particle nonrelativistic quantum mechanics to the 
description of identical particles, we will be led to bosons (symmetric 
under permutations and obeying Bose-Einstein statistics) and fermions 
(antisymmetric, with Fermi-Dirac statistics) as the only types of particles 
consistent with perfect indistinguishability. 

1 . The Indistinguishability of and the State Vector Space for Identical Particles. 
The indistinguishability of two particles makes it impossible to follow them indi- 
vidually in the region where both may be found simultaneously. Similar complica- 
tions do not arise in classical mechanics from the identity of two bodies because 
their wave packets do not overlap, and the particles move in separate, distinguish- 
able, continuous orbits. Even if they are not followed as they move along their orbits, 
two objects that are identical can, classical mechanics assumes, always be made 
distinguishable by marking them in such a way that there is no measurable influence 
on the physical process under consideration. For instance, we suppose that coloring 
the balls in billiards differently has no influence on their motion; yet it serves to 
distinguish them individually and to identify them after any number of collisions. 

If the particles are distinguishable, we saw in Section 15.4 that the appropriate 
quantum mechanical description is a state vector space constructed as a direct prod- 
uct space from the one-particle theory. The same strategy can be employed for n 
indistinguishable particles, but only at the expense of labeling or numbering the 
identical particles. Since the order in which the particles are labeled has, by the 
definition of particle identity, no physical significance, state vectors (or wave func- 
tions) that differ only in the permutation of the labels must define the same state. 
Thus, there is in this mode of description a symmetry under the n\ permutations of 
the particle labels, violating the principle that (except for a phase factor) there is 
exactly one state vector for every physical state of a system. The redundancy in the 
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ascription of identical particles must then be removed by limiting the state vector 
)ace to a subspace that is invariant under the permutations of labels. 

We prefer to build up the quantum mechanics of n identical particles without 
itroducing redundant states that must be subsequently excised. Let us assume the 
cistence of a complete set of dynamical variables K, appropriate for a single par- 
cle. As in Section 9.1, we characterize the corresponding set of eigenvalues col- 
ctively by K h without attaching primes. A particular state of the system is then 
jecified by stating that of the n particles n x have the value K x , n 2 have the value 
2 , etc. More general states are obtained by superposition of such particular 
•particle states. It is impossible to tell which particle has the value K x , which one 
le value K 2 , etc. For example, the coordinates x, y, z and the spin component S z 
institute a complete set of dynamical variables for electrons. By measurement, it 
in be determined that there is one electron at position rj with spin component 
i/2) a i (where cr 1 — + 1 or — 1), another electron at position r 2 with spin component 
i/2)a 2 , etc., but it is not possible to make any further identification of the electrons. 

A basic arbitrariness is inherent in the definition of the term particle as it is 
sed here, and the identity of two particles is to a certain extent a matter of con- 
mtion. For example, protons and neutrons may be regarded as two distinct species, 
istinguishable by differences in mass, charge, magnetic moment, and decay prop- 
ties. Yet, since interactions can transform protons into neutrons and vice versa, it 
frequently convenient to describe them as two different states of the same species, 
ie nucleon. All nucleons are then considered as identical, and proton and neutron 
ates are characterized by different values of a new dynamical variable, the isobaric 
spin," or isospin. In high-energy processes, which are dominated by the strong 
iteractions, strangeness, hypercharge, and other quantum numbers associated with 
ie symmetry group SU( 3) place the neutron and proton in a particular multiplet 
>ctet) which has six other, “strange” spin one-half baryonic members. At a deeper 
vel, these assignments of quantum numbers are understood in terms of the con- 
ituent quark structure of these composite “particles.” The quarks themselves can 
i treated as species with distinguishable flavors, that is, up, down, and strange, and 
) forth, and distinctive colors (red, green, blue), or they may be thought of as one 
nd of particle that can be in states with different flavor and color “quantum num- 
irs.” 

In certain problems, on the other hand, it may be convenient to regard entire 
rclei, atoms, or molecules as particles. Their composite nature implies the existence 
' some internal degrees of freedom, which must be included among the dynamical 
iriables used to describe the states of the system. 

The latitude in the definition of what is meant by a particle is no obstacle in 
e development of the quantum mechanics for identical particles. On the contrary, 
e general principles of the theory are applicable to any species of particle, char- 
fierized by a set of dynamical variables that can be regarded as complete in the 
rntext of the problems under study. 

In developing a mathematical framework for these concepts, we define the quan- 
m state vector space for a system of n identical particles on the basic assumption 
at 

any complete set of operators K, which describes the behavior of a single 

particle, can also be employed for n particles of the same kind. 

bis minimal assumption, postulated to be true even in the presence of interactions 
;tween the particles, implies that the composite system retains the properties of the 
dividual constituent particles to a considerable extent. 
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The physical context defines the meaning of this assumption. For example, in 
the theory of atomic and molecular structure and spectra, and in solid-state physics, 
it makes sense to consider all electrons as replicas of a single electron, with its 
position and momentum coordinates and its spin (one-half), and no additional at- 
tributes. In relativistic quantum electrodynamics, however, it is necessary to include 
a further degree of freedom, corresponding to the positron which, as the antiparticle 
of the electron, represents another state that can be excited by interaction with the 
electromagnetic field. 

As another example, helium in its gaseous or condensed (liquid or solid) form 
is properly regarded as an assemblage of identical inert helium atoms in their ground 
state, provided that the temperatures are not excessively high. At stellar tempera- 
tures, this picture breaks down, and the internal degrees of freedom of the helium 
atom become active. At even higher, thermonuclear, temperatures, the degrees of 
freedom relevant to the structure of the nucleus must be taken into account. 

As long as we assume that the system composed of identical particles is merely 
a collection of the constituent particles, we can postulate that to each eigenvalue K t 
of K corresponds an occupation number operator N t whose eigenvectors characterize 
the states in which a definite number, n,-, of the particles has the value K t . The 
eigenvalues of N t are the occupation numbers n t . As a fundamental postulate, we 
assume that the totality of the operators N t forms a complete set of commuting 
Hermitian operators for the system of identical particles. (If the system is composed 
of several different species of identical particles, occupation number operators for 
each of these are introduced.) In the next two sections, we will see that this postulate 
is responsible primarily for the unique roles played by Bose-Einstein and Fermi - 
Dirac statistics. However, we must first construct the state vector space of the many- 
body system by a suitable generalization of one-particle quantum mechanics, paying 
due attention to the indistinguishability of the particles. 

The fundamental postulate implies that in the state vector space of the many- 
body system, also known as Fock space, the basis vectors 

\n u n 2 , n 3 , . . .) (21.1) 

which allot the eigenvalue K x to n x particles, the eigenvalue K 2 to n 2 particles, etc., 
constitute a complete set of orthonormal basis vectors for the system of identical 
particles. The most general state of the system is a linear combination of the kets 
( 21 . 1 ). 

In particular, we have the “no-particle” (or vacuum) state 

^ (0) = | 0 > = 1 0 , 0 , 0 , ...) ( 21 . 2 ) 

Next there are the one-particle states 

VV = |£j> = |0, 0, . . . , n,- = 1, 0, 0 . . .) (21.3) 

spanning the one-particle subspace of the much larger vector space of the many- 
body system, which is composed of states with zero, one, two . . . particles and their 
linear combinations. The principles of quantum mechanics in the one-particle sub- 
space, which have been the subject of most of this book so far, will now be gener- 
alized to systems containing an arbitrary, and variable, number of identical particles. 

A remark on notation: Equation (21.3) puts us on guard and warns us that in 
this and the next chapter the bra-ket notation will be used in both its narrow single- 
particle form, | A',), and its Fock space version, sometimes even in the same equation. 

Does the postulate adopted in this section provide the necessary ingredients for 
a quantum theory of interacting particles? To answer this question, it must be shown 
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lat a consistent theory can be constructed in this framework and that the theoretical 
alculations are in satisfactory agreement with experimental observations. The as- 
lmption that the state of a system of interacting particles can be expanded in terms 
f the states of noninteracting particles finds broad support in many successful cal- 
alations. Especially when the relatively weak electromagnetic interactions predom- 
late, as in atoms, molecules, and solids, perturbation theory affords an account of 
most unlimited precision, provided that the theory is “renormalized.” Renormal - 
ation involves rules designed to make consistent and accurate predictions possible, 
i spite of the appearance of divergent quantities in the calculations. Quantum elec- 
odynamics and its offspring, electroweak gauge theory, are the most mature ver- 
ons of the theory of interacting fields and particles. Even the theory of the fun- 
imentally nonperturbative strong interactions, quantum chromodynamics, owes its 
iccess to the existence of a regime (“asymptotic freedom”) in which the concepts 
' perturbation theory are applicable and the constituents may be regarded as nearly 
minteracting. 

Creation and Annihilation Operators. For a given species of particles, it is 
;eful to define creation operators aj with the property 

a]\n u n 2 , . . . n,-„ n h n i+1 , ...><* \n u n 2 , . . . n t - u n t + 1, n i+u . . .) (21.4) 

Iding to the basis state one particle with quantum number K h From the definition 
' a Hermitian adjoint operator, it follows conversely that an annihilation operator 
has the property 

a t \n lf n 2 , . . . Hi-i, n h n i+1 , . . .) oc \ n ^ n 2 , . . . n , f _ l5 n ( - 1, n i+1 , . . .} (21.5) 

d thus in effect removes one particle with quantum number K t . The constants of 
oportionality in these defining relations are yet to be determined. In the interest 
simplicity, we begin this determination by requiring 

4^(0) = 4|0> = = | Kt) = Jo, 0, = 1, 0 . . .) (21.6) 

aVP = a,\Kj) = a,|0, 0, . . . n } = 1, 0, . . .) = 8^™ = 8 tj |0> (21.7) 

nee the vacuum contains no particle to be destroyed, we also demand that 

a.yw = 0.| O ) = o (21.8) 

Exercise 21.1. If ^ (1) is an arbitrary one-particle state, prove that the prob- 
ility, |<0|^|^ (l) )| 2 = |(^T ( | > I , ' (l) )| 2 , of finding the particle to have quantum number 
is equal to the expectation value {^ (1) |aJa ; -|T r(l) }. 

In order to establish how a unitary transformation from one basis to another in 
; one-particle theory can be expressed in this new notation, as in Section 9.5, we 
roduce a second complete set of one-particle observables L with eigenvalues L q . 
e corresponding occupation number operators will be denoted by N q with eigen- 
lues h q . The transformation between the two representations is effected by 

|*,> = 2 \L q )(L q \ K t ) (21.9) 

e complex transformation coefficients {L q \K t ) form a unitary matrix. 

The vacuum or zero-particle state <E> <0) is the same in the new as in the old basis: 

$( 0 ) = y<m = | 0 ^ ( 21 . 10 ) 
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The one-particle states in the new basis are 

= \0,0,...,n q = 1, 0, 0,...) = \L q ) 

We now introduce creation operators b\ and annihilation operators b q in complete 
analogy with (21.4) and (21.5): 

b\\n u n 2 ,... n q - lt h ip n q+l , . . .) oc | n u n 2 , . . . n q - u n q + 1, n q+1 , . . .} (21.11) 
b q \h u n 2 , . . .*n q - l , n q , h q+1 , ...)<* |n x , n 2 , . . . n q - u n q - 1, n q+1 , . . .) (21.12) 

Evidently, for the one-particle states, the two bases are connected by the equation, 

«?| 0 ) = = I K t ) = X I L q )(L q \K t ) = X bl(L q \K,)\0) 

q q 

This equation is satisfied by the transformation condition 

(21.13) 

The Hermitian adjoint of this equation, 

(21.14) 

is trivially satisfied when acting on the vacuum state. Applying a, to a one-particle 
state, we have, by the use of Eqs. (21.7) and (21.8) and their analogues, 

= 5 ,^ (0) = X {Ki\L q ){L q \Kj)"^ m = X {K^L^LAKj)*^™ 

= X b q (K,\ L q ) X = X b q (K t \L q )^ 

q r q 

This relation is again satisfied by Eq. (21.14). Hence we may adopt the operator 
Eqs. (21.13) and (21.14) in the subspace of no particles and one particle. 

It is a basic notion of quantum mechanics that the creation of a particle with 
quantum number K t is equivalent to the creation of a particle with any one of the 
quantum numbers L q , each contributing an amplitude {L q \K,) in a linear superpo- 
sition. The operator Eq. (21.13) is consistent with this general requirement. The 
change to yet another, third one-particle basis M s is effected by the relation 

aj - X K(L q \K t ) = X (X bl(L q \M s )j(M s \K ! ) 

explicitly exhibiting the equivalence of all one-particle bases or representations for 
constructing the many-body theory, if (21.13) is valid. 

We therefore make the fundamental assumption that the transformation equa- 
tions (21.13) and (21.14) will hold beyond the no-particle and one-particle subspaces 
as operator equations in the entire state vector space of the system of any number 
of identical particles. We will see that this assumption determines the structure of 
the commutation relations that the creation and annihilation operators must satisfy 
and narrows the freedom we have in deriving the constants of proportionality which 
enter (21.4), (21.5), (21.11), and (21.12). Natural, and almost innocuous, as this 
assumption, or principle of unitary symmetry, may appear, it is ultimately respon- 
sible for the restriction to Bose-Einstein and Fermi-Dirac statistics in the quantum 
theory of many-particle systems. Hypothetical quanta or particles called anyons. 
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/hich violate the fundamental assumption made here, are useful constructs in certain 
pproximate models of (especially, two-dimensional) many-body systems, but they 
nd other unconventional versions of quantum statistics (so-called parastatistics) are 
utside the scope of this book. 


. The Algebra of Creation and Annihilation Operators. The first step in deriv- 
lg the algebraic properties of creation and annihilation operators is to note that two 
reation operators a] and a], when applied successively to a state, produce the same 
hysical state, although the normalization may depend on the order in which the two 
articles are created. Hence, for any state *P, 

ajajty = AaJaf'P (21.15) 

To show that the constant A is independent of the state TP and of the subscripts 
and j — and indeed of the particular one-particle representation chosen — it is ad- 
antageous to consider what happens in changing to another one-particle represen- 
ttion. Substituting a} from Eq. (21.13) into Eq. (21.15), we get 

(ajaj - AajajyV = 2 <4 1 K t ){L e \ K } )(blb\ - A b\b\W = 0 (21.16) 

k,{ 

he transformation coefficients are arbitrary complex numbers, except for the con- 
:raint imposed by unitarity: 

2 (Ki\L k )(L k \Kj) = S v (21.17) 

k 


: the theory is to have the same form in any representation (unitary symmetry), Eq. 
11.16) can be satisfied for all states 'P only if, for every value of k and €, 


b\b\ - A b\b\ = 0 

ence, also, 

b\b\ — A b\b\ = 0 

id consequently 


A 2 = 1 or A = ±1 


ccording to the assumption spelled out in Section 21.2, these algebraic relations 
>r creation operators must hold for any of the unitarily equivalent one-particle 
presentations, including the original basis created from the vacuum by the oper- 
ors aj. 

It follows that there are two and only two classes of algebraic relations for 
eation operators of various particle species. They must satisfy either the commu- 
tion relations 


a] a] — ajaj = 0 


• the anticommutation relations 


a] a] + ajaj = 0 


(21.18) 


(21.19) 


le possibility that for a specific kind of particle (21.18) holds for some creation 
aerators and (21.19) for others need not be contemplated if there are no super- 
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selection rules operative that would restrict superpositions and the unitary transfor- 
mations (L k \K t ) in the one-particle theory to certain subspaces. 

By taking the Hermitian adjoint of Eqs. (21.18) and (21.19), we obtain, re- 
spectively, 


and 



( 21 . 20 ) 


( 21 . 21 ) 


A similar argument can now be made for two operators a, and a], with i + j. 
Consider 

a,aJ\P' = fiajaffr 

The unitary transformations (21.13) and (21.14) give for i + j, 

(a.aj - paja^ = £ <^|L*><L,|^>(^J - = 0 (21.22) 

k,Z 

and this must be valid for any state. In satisfying this condition, we must again take 
the unitary constraint (20.17) into account, leading to the inference that for all values 
of k and €, but subject to the restriction k # €, we must have 

b k b\ - pb\b k = 0 (21.23) 

and thus also 


a t a] — pa]a t — 0 for i A j (21.24) 

If we substitute this back into Eq. (21.22), we get (for i f j) 

S (K, I L k )(K k | Lj)(b k b k ~ nbtb k n = 0 

k 

This equation is compatible with the unitarity condition (21.17) only if 

b k b\ - pbib k = A (21.25) 

where the operator A is independent of the subscript k. If we now expand the ex- 
pression — pM\a t by use of the transformation relation (21.13) and (21.14) and 
apply the conditions (21.23) and (21.17), we conclude that 

a £ -al - pa] a t — A (21.26) 

Hence, the Hermitian operator A is the same for all pairs of creation and annihilation 
operators in any one-particle basis. By letting (21.26) act on the vacuum, we see 
from Eqs. (21.6) and (21.8) that A | 0 ) = | 0 ). Hence, we may choose A to be the 
identity operator, A = 1: 

Uia] — pa]a t = 1 (21.27) 

To complete the theoretical framework, we define the Hermitian operators N t 
whose eigenvalues are the occupation numbers, n h of the particles for which K has 
the value K t . The sum of these, 




(21.28) 
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list be the additive operator measuring the total number of particles, and this must 
invariant under a change of the one-particle basis: 

N = 2 N, = 2 N k 

i k 

;her than the trivial identity operator, essentially only one additive operator is 
variant under the transformations (21.13) and (21.14). It is 

2 aja,- = S b\b k (21.29) 

i k 

rich is invariant by virtue of the unitarity of the transformation {K t \L k ). The op- 
ator 

2 ad = X b k bl 

i k 

also invariant, but because of the com m utation relation (21.27), it can be expressed 
terms of (21.29) and the identity. 


Exercise 21.2. Verify the invariance of the additive operator (21.29) under 
e , unitary transformation (21.13), (21.14) (principle of unitary symmetry). 


Thus, we expect N t to have the form 

N t = x a]a t + y 1 

here x and y are real constants. We must require that V, |0) — 0, hence, y = 0. 
irthermore, for a one-particle state, 

N&P = V?' 


om (21.7) and (21.6) it follows that x = 1. Thus, the occupation number operators 
e simply 


N t — aja i 


(21.30) 


lese operators are positive definite and have nonnegative expectation values, as 
;pected. 

If the basis vectors (21.1) are to be eigenstates of N h it is easy to see from the 
:fining relations (21.4) and (21.5) for the creation and annihilation operators that 
e following conditions must be met: 

N t a k - a k Ni = — a\N, = 0 for i + k (21.31) 

well as 


N t ai - a t Ni = -a t 


(21.32) 


id 


Nidi — a\Ni = aj 


(21.33) 


the commutation relations that have so far been proved, (21.1 8)— (21 .21), and 
1.24), are employed to bring the operator a k to the right in Eq. (21.31), we deduce: 


(1 T ix)N,a k - 0 
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for all i f k. The upper sign, or p — +1, corresponds to (21.18), and (21.20) and 
the lower sign, or p = —1, to (21.19) and (21.21). 

We have thus arrived at the conclusion that, consistent with our fundamental 
assumptions, there are two and only two forms of quantum mechanics for identical 
particles: 


Bose-Einstein Case 

a\a\ — a\a\ = 0 
a k a e a^a k 0 

a j < 5 , kt 1 


(21.34) 


Fermi-Dirac Case 

ala] + ajal = 0 

a k a^ t ci i a k 0 
a k a\ + a\a k = 8 ke 1 


(21.35) 


(In the literature, the brace {A, B} = AB + BA is often used to denote an 
anticommutator, similar to the bracket [A, B ] = AB — BA for a commutator. For 
our purposes, little is to be gained by using the { } notation, and so we prefer to 
write out anticommutators explicitly.) 

The designations Bose-Einstein (B.E.) and Fermi-Dirac (F.D.) associated with 
the two classes of commutation relations will be seen to be justified when the sta- 
tistical behavior of identical particles is examined. Characteristically, each particle 
species belongs to one or the other of these two kinds of statistics. We call the 
former species bosons since they obey Bose-Einstein statistics, and the latter fer- 
mions, since they obey Fermi-Dirac statistics. A connection between the statistics 
of a particle and the transformation of its states under spatial rotation is found in 
quantum field theory, leading to the conclusion that all particles with integral spin 
are bosons and all particles with half-integral spin are fermions. 1 

One immediate consequence of Eq. (21.35) is that, for fermions, a\a\ — 0; i.e., 
there are no fermion states in which two or more particles share the same quantum 
number. This is the familiar expression of the Pauli exclusion principle. 


Exercise 21.3. Prove that the occupation number operators N t for different i 
commute for either Bose-Einstein or Fermi-Dirac statistics. 

Exercise 21.4. From the commutation relations for a, and aj, deduce that the 
operator N t = a-a, has as eigenvalues all nonnegative integers in the case of Bose- 
Einstein statistics, but only 0 and 1 in the case of Fermi-Dirac statistics. 

The operators = a]a t determine the constants of proportionality in Eqs. (21.4) 
and (21.5) except for a phase factor. Indeed 

<«i, n 2 , . . n t . . |2V f | n 1; n 2 , = n t = (n u n 2 , . . n t . . lajail^, n 2 , . . n t . .) 
Hence, 

a t |«i, n 2 , . . . n t . . .) = e lct \fn i \ni, n 2 , . . . n t —l . . .) (21.36) 


Gross (1993), Chapter 7. 
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or Bose-Einstein statistics, we may set a = 0. For Fermi-Dirac statistics, it is 
onsistent with the anti commutation relations to choose e ,a to be equal to 1 if the 
umber of occupied one-particle states with index less than i is even, and equal to 
- 1 if this number is odd. Other phase assignments are possible, but they must always 
e consistent with the commutation and anticommutation relations. 

Exercise 21.5. Prove the feasibility of the suggested phase assignments in Eq. 
>1.36). 

To summarize the effect of the creation and annihilation operators, we omit all 
lentifying subscripts and focus on the action of the typical operators a and on 
le eigenstates \n) of the occupation number operator N = a t rz, such that 

a t a\n ) = n\n) (21.37) 


It have 

ose-Einstein Case 

a\n) = \ r n\n) 

a t |n) — 

Jn + 1 1 n) 

(21.38) 

ermi-Dirac Case 

«|0) = 0 

a f |0> = 

e~ ia \l) 

(21.39) 


a\l) - e ia \0) 

<z + |l> = 

0 



or bosons (B.E.„case) the structure of the theory is isomorphic to that of the har- 
lonic oscillator, since the energy levels of the oscillator are spaced equally and can 
e interpreted as representing n quanta (or particles) of energy ho). The oscillator 
Hamiltonian (10.70) is just 

H = ho) (N + 1/2) (21.40) 

links the second quantization for bosons to the operator algebra treatment of the 
scillator in Section 10.6. 

. Dynamical Variables. Useful dynamical variables for a system of an arbitrary 
umber of identical particles can now be constructed. The operator % that measures 
le total value of an additive one-particle quantity K, like the kinetic energy, is 

3C = 2 KiN, = 2 KtaUt (21.41) 

/ i 

here a,- and aj destroy and create particles with quantum number K t . It is desirable 
) express the operator % in terms of an arbitrary set of creation and annihilation 
perators, b t and b\, which are related to a, and a] by (21.13) and (21.14). Applying 
iis transformation to (21.41) yields 

(21.42) 


nee 

2 (L q | K^KiiKi | L r ) = {Lg | K\ L r ) 



(21.43) 
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The operator % is the most general Hermitian operator that is bilinear in annihilation 
and creation operators. 

An additive two-particle operator, like the mutual potential energy, can be used 
to define 

t = \ s mVij + 1 2 a m - w, = ] - 2 w - him) 

^ i&j ** i ^ ij 

a 

where we may assume, without loss of generality, that the numbers V i} form a 
real symmetric matrix. It is easy to see that the pair distribution operator 
Py = NjNj — SijNi, occurring in (21.44), may be written as 

Py - NjNj - 8 S N, - alaJajOi (21.45) 

for either the Bose-Einstein or Fermi-Dirac case. Hence, Eq. (21.44) may be written 
more concisely as 

r = 1 2 Vij (2i.46) 

After the transformation (21.13), (21.14) is applied, this operator acquires its gen- 
eralized form 

(21.47) 

where 

2 (L q | K,XKi | L t )(L r \Kj){Kj | L s )Vij - (qr\V\ts) (21.48) 

ij 

is the general two-particle matrix element. The order of quantum numbers in (21.48) 
implies the convention that q and t belong to one particle, and r and 5 to the other. 
From the reality and symmetry properties of the Vy we deduce that the two-particle 
operator matrix elements have the property 

(qr\V\ts) — (rg|y|^) (21.49) 

Figure 21.1 shows a diagram representation of the two-particle operator 
b\b\b s b t {qr\ V\ ts), which is a typical term in the interaction operator Y, (21.47). 

Exercise 21.6. Verify Eq. (21.49). 

Exercise 21.7. Show that the square of an additive one-particle operator is 
generally expressible as the sum of an additive two-particle operator and an additive 
one-particle operator. 

Exercise 21.8 , Prove from the commutation relations that 

<0|a;fl y 44|0) = 8 jk 8 i( ± 8 ik 8 J( (21.50) 

the sign depending on the statistics, B.E. (+) or F.D. (— ). Also calculate the vacuum 
expectation value (0|a ft a,a^44 c m |o>. 
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igure 21.1. Diagram representing the two-particle matrix element (<?r| V|ts). The vectors 
! and k 2 (in units of h) are “in” momenta, and k 3 and k 4 are “out” momenta, (See 
roblems 1 and 2 in Chapter 21.) 


. The Continuous One-Particle Spectrum and Quantum Field Operators. So 
ir in this chapter we have implicitly assumed that the eigenvectors of the complete 
it of commuting one-particle operators, from which the n-particle basis is con- 
ducted, are normalized to unity, corresponding to a discrete set of eigenvalues, 
iow we must consider a one-particle basis belonging to the continuous spectrum of 
complete set of commuting operators symbolized by x, such that the one-particle 
igenvectors are normalized as 

( x o-'|jc", a") — S(x' — d")do-', o-" 

he continuous eigenvalues x' of x and the discrete eigenvalues a' of cr label the 
asis vectors. The most common example of a continuous. basis occurs if x is the 
rdinary position coordinate of a particle. The discrete variable a may represent the 
■component of the spin of the particle, as explained in Section 16.3. 

The formalism of the preceding sections can be modified and transcribed with- 
ut difficulty to be applicable to this case, but the occupation number representation 
; no longer useful, unless by a discretization procedure the coordinate space is 
rtificially partitioned into a large number of small but finite cells. 

It is customary to write ^-(r') for the annihilation operator, which destroys a 
article with quantum number a' at position r', instead of using the notation a { 
hich is employed in the discrete case. An operator like *|v(r'), which depends on 
le position coordinates, is generally referred to as a quantum field operator or 
mply a field. 

With this new notation, we now generalize the commutation relations (21.34) 
id (21.35) and postulate: 

ose-Einstein Case 


iMr'>Mr") - iMr")i|v(r') = 0 

~ i|ij-(r w )i|ij.,(r') = 0 

*!v(r'M*t"(r") - i|4»(r")ifv(r') = 8(r' - r")S^ 


(21.51) 
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Fermi-Dirac Case 


i!v(r'>Mr") + *Mr>Mr') = 0 

= 0 (21.52) 

= S(r' - r")S^ 

The operator »}ip(r ')»}v (r') represents the density at the position r' of particles 
for which the discrete variable has the value cr', and 

N = 2 f ^(OiMO dV (21.53) 

cr' J 

is the operator whose eigenvalues are the total number of particles. 

For an additive one-particle operator, we have 

= 2 \\ tyl(r')(r'(r'\K\r"<T")tyAr") d 3 r' d 3 r" (21.54) 

cr' o'" J J 

and for the common kind of additive two-particle operator, 


(21.55) 


r = \ 2 f f [ «Ji;-(r')iJi^(r'> fy »(r"')^^(r"") 

^ cr’ cr* a ) " cr"" J J J 

X (r'cr', r"a" \ V\ r""a"", r'V"> dV d 3 r" d 3 r m d 3 r"" 

An arbitrary n-particle state ty (n) can be conveniently represented by the function 


<KriO-i, r 2 cr 2 , . . . , r n a n ) = {Olil^rO^fo) . . . ^(rj | V (n) ) 


(21.56) 


Evidently, with respect to permutations of the subscripts 1, 2, ... n, if/ is symmetric 
in the Bose-Einstein case and antisymmetric in the Fermi-Dirac case. In the trivial 
case of a one-particle state, n — 1, Eq. (21.56) reduces to 

if/(rcr) — (0|iJ/ cr (r)|M , ' (1) ) = (rcrj'5 , ' a) ) (21.57) 


which is simply the ordinary one-particle Schrodinger wave function with spin, rep- 
resenting the state 

The representation (21.56) of an n-particle state is convenient for many pur- 
poses. The function if/( ritr u r 2 cr 2 , . . . , r n a n ), which is symmetric (B.E. case) or 
antisymmetric (F.D. case) under exchange of any two-particle labels, is the wave 
function of the n-particle system in the 3n-dimensional Cartesian configuration space 
of the coordinates r l5 r 2 , . . . , r„. Such a representation of a state of n identical 
particles implies a numerical labeling of the identical particles, but configuration 
space wave functions, which, under particle label exchange, are neither fully sym- 
metric nor antisymmetric, are ruled out as not representing a physical state of n 
particles. The Fock representation, with its emphasis on occupation numbers of in- 
trinsically indistinguishable one-particle properties, has no room for such physically 
inadmissible states and, therefore, provides a more economical description. 2 

The usefulness of the n-particle wave function (21.56) in configuration space 
is attributable to the circumstance that, if the operators r<r constitute a complete set 


2 For a more complete account, see Sch weber (1961), Chapter 6. 
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observables for a single particle, the function if/ as given by formula (21.56) 
termines the state of n identical particles. The formula that inverts (20.56) is 


|T<n)\ = 

S [ 77t ^l„( r «) • • ■ r 2 o- 2 , . . . , r n cr n ) d 3 r x . . . d 3 r n 

<r l o‘2 .- J v ft! 

(21.58) 


r its proof, the following lemma is required: 

F n = <0|i|/ o . 1 (r 1 )i|i (7 . 2 (r 2 ) . . . • • • »|»»j(ri)|0> 

- X sgn(P„)5(rj - rJ)S(r 2 - r 2 ) . . . S(r„ - r' n )8 , 8, 


(21.59) 


1 "< r 2 (r 2 


lere the sum is to be taken over all permutations P n of the coordinates rjoy, 
t 2 , . . . (but not the primed variables) and sgn(P„) is the sign of the per- 
ltation. For the Bose-Einstein case, sgn(P„) = +1; for the Fermi-Dirac case, 
n(P„) = + 1 if the permutation P n is even and sgn(P„) = — 1 if it is odd. 

Formula (21.59) is proved by induction. It is obviously valid for n = 1. Use of 
i commutation relations (20.51) or (20.52) gives 


F„ = {Ol^o-jOa) . . . . . . ifr^(rl)jO)S(r„ - r^) 8^ 

± <0|^ o - 1 (r 1 ) . . . «|» <r ._ 1 (r fI _i>i»^(ri)»I» (r|t (r fl >!»i ; _ 1 (r;_i) . . . ^(rolo) 

' continuing to move the operator iff^r,,) to the right, step by step, and by applying 
. (21.59) for n — 1, we eventually obtain 

F n = 2 s gn(P„-i)5(r 1 - r{) . . . S(r„_! - - r') 

Pn - 1 

± X sgn(P„_ 1 )5(r 1 - r{) . . . S(r„_ 2 - r;_ 2 )S(r„_! - r')S(r„ - r'_ t ) 

Pn- 1 

+ 2 sgn(P„_ i)5(ri - r[) . . . S(r„_ 2 - r,;_ 1 )S(r n _ v - r' n )8(r n - r ' n _ 2 ) 


dch can be contracted into the right-hand side of Eq. (21.59). (The Kronecker 
Itas carrying the spin variable have been suppressed for brevity in the last equa- 
n.) 

Armed with the generalized orthogonality relation (21.59), we can now derive 
. (21.58) from (21.56) if we assume that a general n-particle state can be repre- 
lted by 

(«)^ — 

-7=7 S [ • • • (Pr^ij, r„) . . . ^(^^(rUlO^rjO-i, r 2 <r 2 , . . . , r„o-„) 

(21.60) 

this expression is substituted for ^ in Eq. (21.56) and the integrations are carried 
t, using lemma (21.59), 


(Kuo-i, r 2 o- 2 . 


r nVn) 


1 


2 


sgn(P n )/(r 1 cr 1 , r 2 <x 2 , 


r n o- n ) 
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results. On the other hand, owing to the commutation relations for the creation 
operators, (21.60) can be written as 




2 f d 3 r x . . . d 3 r n tyl n ( r„) . . . *|»^(r 2 >|»J. 1 (ri) 1 0) 

02 ... J 


* 



sgnCPJ/CrjO-j, r 2 cr 2 , . . . , r n cr n ) 


Hence, Eq. (21.58) follows. 

By taking the inner product of 0I r( ” ) | with (21.58) and using the complex con- 
jugate of Eq. (21.56), we get 


<^ (n) |^ (M) > = 2 f l«Kr job, . - - , r„(T„)| 2 d\ . . . d?r n (21.61) 

cr 1 o'2... J 

This equation establishes the normalization of the wave function if/, which can thus 
be made the basis of a straightforward probability interpretation in configuration 
space: The particles are labeled numerically, as if they were distinguishable, 1 
through n, and j^Oucrx, r 2 tr 2 , . . . , r„cr n )| 2 is understood as the probability density 
of finding particle 1 at position r j with spin cr 1 , particle 2 at position r 2 with spin 
<r 2 , and so forth. This description has been the traditional approach to the quantum 
mechanics of n particles. For the special case of n — 2, we already encountered it 
in Sections 15.4 and 18.8 (helium atom). 

Expectation values (and off-diagonal matrix elements) of operators can be ex- 
pressed either in the language of creation and annihilation operators or in terms of 
wave functions in multidimensional space. Thus, if ^T(r, p) is a one-particle operator, 
which is a function of the coordinate r and the conjugate momentum p, the expec- 
tation value of the sum operator K in a normalized n-particle state is 


0P (n) , m (n) ) = <q^|3{|qrM) 

- 2 j *Ur')K(r', y V'^Mr') d 3 r' ^ 

= 2 2 f 0*(ri<Ji, .. - V4 iKuoy, . . r n a n ) d 3 r x . 

j— 1 <r\ o-2 -. J \ 1 / 


. . d 3 r„ 


(21.62) 


Similarly, if the two-particle interaction operator V is local in coordinate space and 
spin-independent, such that 

<rV, r"cr" | V | r""cr"", r"V"> = V(r\ r")8(r' - r"")S(r" - (21-63) 

we may write for an n-particle state, 

yq/OO) — ^'y(«)| < y|'vjr(")) 

„ (21.64) 

= 22 I 00*1 oi, - . - , r n af)\ 2 V{r h tJ) d 3 r l . . . d 3 r n 

i>j=lo- 1 tT 2 ... 


Exercise 21.9. Prove Eqs. (21.62) and (21.64). 


6. Quantum Dynamics of Identical Particles. Up to this point in this chapter, 
nothing has been said about changes in time. Following the program of Chapter 14, 
we now assume that the time development is governed by a Hermitian operator 
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e Hamiltonian of the system. For simplicity, we also assume that 'M is not explicitly 
;pendent on the time t. 

We must decide which picture to use, such as Schrodinger, Heisenberg, or in- 
raction. In this section we will first use the Heisenberg picture, because it is 
structive to see the similarity of the equation of motion for the field operator 
(r, t) with the wave equation in the one-particle theory. Eventually, we return to 
e Schrodinger picture to derive the equation of motion for the wave function in 
•particle configuration space. 

All physical observables can be constructed from creation and annihilation op- 
ators. It is, therefore, of interest to determine the temporal behavior of the fun- 
imental building blocks and to derive the equation of motion for some arbitrary 
inihilation operator Since, in what follows, all operators will be evaluated at 
me t, we will, for brevity, omit reference to the time dependence in some of the 
icillary equations. The commutation relations are valid at all times, provided that 
e two factors are taken at the same time. 

In order to obtain an equation for the time development of aj, it is helpful to 
ake explicit assumptions about the structure of the Hamiltonian operator. We cover 
any important applications by making the physical assumption that °<li preserves 
e total number of particles and that it is composed only of additive one- and 
/o-particle operators. The most general form of is thus assumed to be 

W = 2 at(k\H 0 \i)a e + ]r 2 a\a\{qr\ V\ts)a s a t (21.65) 

kt 2. q rst 

here H 0 is a one-particle operator, such as the kinetic energy plus the potential 
lergy in an external field, and V is a two-particle interaction operator. 

To derive the equation of motion for a 3 , we must evaluate the commutators 

[aj, ata ( ] and [a,-, alaja s a t ] 

ais is easily done by using the commutation relations for creation and annihilation 
)erators. The results, after some straightforward algebra, are 

[a p a\a t ] = a ( 8 kj ‘ (21.66) 

id 

[aj, a\ala s a t ] = a f r a s a t 8 qJ + ala t a s 8 rj (21.67) 

is remarkable that these relations hold for both Bose-Einstein and Fermi-Dirac 
atistics. 

Exercise 21.10. Prove Eqs. (21.66) and (21.67) for both kinds of statistics. 
Using these identities, we obtain in the Heisenberg picture: 

ih —7 = [aj, X] = 2 U\Ho\*)a< + \ 2 a\a s a,{jr\V\ts) + \ 2 a\a t a s {qj\V\ts) 
at £ £ rst Z q S i 

= 2 (j\H 0 \Z)a e + ^2 ala s a t ((jr\V\ts) + <tf|V|jf» 

i 2 rs t 

nally, the symmetry relation (21.49) gives, for both statistics, the desired equation 
motion: 



( 21 . 68 ) 
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A case of special interest arises if we work in a representation in which the 
two-particle interaction matrix element is diagonal: 

(jq\V\sr) = V jq 8 js 8 qr {V jq = VJ (21.69) 

If (21.69) holds, the equation of motion for the annihilation operator is 

(21.70) 

To examine the form that this equation takes if the one-particle basis is the 
coordinate representation, supplemented by the discrete variable cr, we assume that 
the interaction energy is local and spin-independent, as in (21.63). Equation (21.70) 
can then be transcribed by using cr and r to denote the one-particle quantum numbers 
as labels on the operators ili^r) and *Jr£.(r), replacing a, and a] labeled by their 
subscripts. 

This transcription results in 

ih — ! = 2 f <rajH 0 |r'o-'>iJv(r', t) dfir' 

*' /r\ i o -i \ 

+ S 

cr' 

where V(r, r') = V(r' , r). If, as is often the case, the one-particle operator H 0 is a 
function only of coordinates and conjugate momenta so that 

(rojtf 0 |r'cr'> = tf 0 ^r, y V^S(r - r')8 a<T , (21.72) 

Eq. (21.71) can be rewritten as an integral-differential equation in ordinary space: 


J V(r, r')i|it'(r', t) fa. (r', t) d 3 r' ^(r, 0 



t) 


= H 0 \ r, T V U]i CT (r, t) 


+ 2 f V(r, r')il»t'(r\ t) d 3 r' t ) 

cr' J 


(21.73) 


Again, the universality of this equation, applicable to both bosons and fermions, 
deserves to be emphasized. 

If there is no interaction between the particles, (21.73) reduces to 


. diho-(r, t) ( h \ 

ih — Tt — = H °\ r, l v 


(21.74) 


which, in form, is identical with the one-particle wave equation — the time-dependent 
Schrodinger equation. However, it must be remembered that here ^(r, t) is a field 
operator and not an ordinary wave function. Equation (21.74) can be regarded as 
the quantized version of the wave equation (3.1) when the latter is interpreted as a 
classical field equation. The annihilation operator corresponding to the coordinate 
r was denoted by if/(r), rather than some other symbol, in the foreknowledge of its 
ultimate identification with the quantized Schrodinger field. This is the point of 
contact between quantum field theory and the many-body formulation developed in 
this chapter, traditionally misnamed second quantization. 

The interaction term in Eq. (21.73) causes the equation of motion in the Hei- 
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enberg picture to be nonlinear. Since it involves an integral over all space, it can 
e interpreted somewhat casually as describing the effect of all the particles on a 
ingle one. The effective potential 

2 [ V(r, r'WUr', t)x K'(r’, t ) d 3 r' = 2 f V(r, r')p„,(r\ t) d 3 r' 

a~' J (j' J 

an be calculated only if the solution of the equation of motion is already known, 
bius suggesting an iteration procedure to generate a self-consistent solution. Such 
;chniques for solving the many-body problem are, indeed, frequently applied (see 
Chapter 22). 

From the equation of motion (21.73) for the field operators, we can now derive 
le wave equation in configuration space for an n-particle system. The time-depen- 
ent wave function is defined by introducing the time into (21.56) and writing 


iKrioy, r 2 cr 2 , . . . , r„cr n ; t ) 


\fn\ 


<0|ifi CT] (ri, t)*Mr 2 , 0 • • • • (r„, 


(21.75) 


Ve apply ih — to both sides of this equation and replace the time derivatives of the 
dt 

eld operators by the expression on the right-hand side of (21.73). The noninter- 
cting part of the Hamiltonian is easily seen to lead to a sum of n separate terms in 
le wave equation, one for each particle. The interaction term is reduced by use of 
ae commutation or anticommutation relations for the field operators, moving 
f£.(r) to the left in successive steps and recognizing that (0|iff^.(r) = 0. When the 
ermutation properties of the wave function are taken into account, the wave equa- 
on in configuration space is obtained as 


ih ^ i/<r i(Tu r 2 a 2 , . . . r„a n ; t) = 

at 

;i4h) 


+ ^ E v ( r P r t ) 

z y',€= 1 

U*e) 

* 

^(rjo-j, r 2 cr 2 , . . . r n a n ; t ) 


(21.76) 


'his equation has the form expected for any n-particle configuration-space wave 
quation. The permutation symmetry of ^(i^o-j, r 2 <x 2 , . . . , r n cr n ) is conserved as a 
unction of time. 

Exercise 21.11. Complete the steps in the derivation of (21.76) from (21.75). 


’ roblems 

. (a) Show that if V(r) is a two-particle interaction that depends only on the distance 
r between the particles, the matrix element of the interaction in the k-representation 
may be reduced to 

<k 3 k 4 | Vl^k,) = S( k t + k 2 - k 3 - k 4 ) ^ j V(r) e-* r d 3 r 
where hq is the momentum transfer /i(k 3 — kj). 
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(b) For this interaction, show that the mutual potential energy operator is 
T = J fff d 3 k x d% d 3 q V(ki + q)V(k 2 - q)<J>(k 2 )<Kk,)F(q) 

where F(q) is the Fourier transform of the displacement-invariant interaction. 

2. Show that the diagonal part of the interaction operator Y, found in Problem 1 in the 
k-representation, arises from momentum transfers q = 0 and q — k 2 — k t , respec- 
tively. Write down the two interaction terms and identify them as direct (q = 0) and 
exchange (q - k 2 — kj interactions. Draw the corresponding diagrams (Figure 21.1). 

3. In the k-representation, calculate the matrix element of the interaction in Problem 1 
for the screened Coulomb potential V 0 e~ ar /ar and plot it as a function of q. For. 
bosons and fermions, construct the corresponding two-particle interaction operator T 
for identical particles in terms of the creation and annihilation operators in k-space. 

4 . Defining the momentum space annihilation operator 3 

<t>(P) = | <p|r)»|i(r) d 3 r 

derive the commutation (or anticommutation) relations for <J>(p) and <j> f (p). For the 
Bose-Einstein case, show that the mixed commutator of field operators in coordinate 
and momentum space is 

[<j>(p), V(r)] = <p|r> 

5. In the second-quantization formalism, define the additive position and total momen- 
tum operators 

r - J »|/ t (r)r»|i(r) d 3 r and p = J 4> + (p)p<t»(p) d 3 p 

and prove that for bosons their commutator is 

[r, p] = ihN 1 

where N is the operator representing the total number of particles. Derive the Hei- 
senberg uncertainty relation for position and momentum of a system of bosons, and 
interpret the result. 

6. Local particle and current density operators at position r are defined in the second- 
quantization formalism as 

p(r) = J i|# + (r')S(r' - r)iji(r') d 3 r‘ 

and 

> /• 

j(r) = — J d 3 r' ty\r')[VS(r' - r) + S(r' - r)V']i|>(r') 

(a) Show that the expectation values of these operators for one-particle states 
are the usual expressions. 

(b) Derive the formulas for the operators p(r) and j( r) in the momentum rep- 
resentation. 

7. Two identical bosons or fermions in a state 

^ (2) = A 2 Cidja] a}Y (0) = A X dja] 2) c i a l 1 0) 

U j 1 


3 For simplicity in Problems 4-6 we suppress any spin reference to spin variables. 
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are said to be uncorrelated (except for the effect of statistics). If X k,-| 2 = 
2 \di\ 2 = 1, determine the normalization constant A in terms of the sum 
S = 2) c*di- 

(a) In this state, work out the expectation value of an additive one-particle op- 
erator in terms of the one-particle amplitudes c, and d, and the matrix elements 
(i\K\j). 

(b) Show that if S = 0, the expectation value is the same as if the two particles 
with amplitudes c, and d t were distinguishable. 

(c) Work out the expectation value of a diagonal interaction operator in terms 
of c h dj, and the matrix elements (ij\K\k£) = V ij 8 ik 8 J{ . Show that the result is the 
same as for distinguishable particles if the states of the two particles do not overlap, 
i.e., if c t di = 0 for all i. 

. A state of n identical particles (bosons or fermions) is denoted by |'\P' (n) ). 

For n — 1, the probability of finding the particle in the one-particle basis state 
i is the expectation value {'^ f(1) | A, |^ r(1) ). (See Exercise 21.1.) 

(a) For n = 2, prove that the probability of finding both particles in the one- 
particle basis state i is the expectation value of IV, (IV,- — l)/2. 

(b) For n = 3, obtain the function of IV,- whose expectation value is the proba- 
bility of finding all three particles in the same basis state i. 

(c) For n = 2, show that the expectation value of NjNj is the probability of 
finding the two particles in two different basis states, i A j. Prove that the probability 
of finding one particle in basis state i and the other particle not in basis state i is the 
expectation value of IV,- (2 — IV,-). 



CHAPTER 22 

Applications to Many-Body Systems 


In this chapter, “second quantization” as a unifying concept of many- 
particle physics will be illustrated by several applications. We return to 
the coupling of angular momenta and present the Hartree-Fock self- 
consistent field method, leaving detailed discussion of many-body 
problems in atomic, condensed-matter, and nuclear physics to the 
specialized literature. 1 The thermal distribution functions for the ideal 
Bose-Einstein and Fermi-Dirac gas will be derived directly from the 
commutation and anticommutation relations for the creation and 
annihilation operators for the two species. 


1. Angular Momentum in a System of Identical Particles. An important example 
of an observable in a system of identical particles is the angular momentum operator, 
which according to (21.41) is the additive one-particle operator 

$ = 2 X S a Jm’ a a jma (jm'\j\jm) (22.1) 

a j mm' 

The one-particle basis is characterized by the angular momentum quantum numbers 
j and m, as defined in Section 17.4, and a stands for all remaining quantum numbers 
needed to specify the basis. The total angular momentum operator (22.1) owes its 
simple structure to the absence of off-diagonal matrix elements of J with respect to 
j and a. 

The operator $ 2 = $ • $ is not just the sum of the J 2 for the individual particles 
but contains terms that couple two particles. Thus, it serves as an example of an 
additive two-particle operator. Since it conserves the number of particles, annihi- 
lating one and creating one, the operator $ commutes with the total number-of- 
particles operator, N. 

Exercise 22.1. Exhibit the two-particle matrix elements of the square of the 
total angular momentum explicitly. (See also Exercise 21.7.) 


The two-particle states in which $ z and J? 2 have the sharp values Mh and 
J(J + 1 )h 2 are readily constructed by the use of the Clebsch-Gordan coefficients 
defined in (17.52): 

V?m = C 2 a} 2m2a2 a] ]miai {jJ l m 1 rn 2 \jJ 2 jM)y (0) (22.2) 

m l m 2 

where ^ t0) = 1 0) is the vacuum state. Since the one-particle state aj ma |0) is nor- 
malized to unity, (22.2) completely parallels expression (17.52). The normalization 
constant C = 1, unless ol 1 = a 2 and j x = j 2 . 


'For further study see Koltun and Eisenberg (1988) and, at a more advanced level, Fetter and 
Walecka (1971). Thouless (1961) emphasizes models of many-body systems that are exactly soluble. 
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The expression (22.2) remains an eigenvector of $ z and f 2 even if a 1 = a 2 and 
= j 2 = j, but the normalization is altered. The symmetry relation (17.61) permits 
» to rewrite (22.2) in the form 

y?M = \ [1 ± (-1 ) J ~ 2J ]C X a } m2Ct2 a] m lUl 0>, m 2 \jjJM)A> (0) (22.3) 

ith the upper sign applicable to bosons and the lower sign to fermions. Hence, the 
igular momenta of two identical bosons (fermions), which share all one-particle 
lantum numbers except m, cannot couple to a state for which J — 2 j is an odd 
ven) number. If the usual connection between spin and statistics is assumed and 
jsons (fermions) have integral (half-integral) spin, odd J values of the total angular 
omentum cannot occur for two alike bosons or fermions with the same a and j. 

The value of the normalization constant C in (22.3) may be determined by 
quiring m) = 1- The unitarity condition (17.59) readily yields the value 

= 1/V2, so that if = a 2 and — j 2 — j. 


^ ( JM = 2 a jm 2 a a Jrn x a{jjm i W 2 \jjJM)V (0) 


(22.4) 


Exercise 22.2. Verify the normalization (22.4). 

Exercise 22.3. Construct explicitly in terms of states of the form a] m2Q .a] m](x 1 0 ) 
e total angular s momentum eigenstates for two neutrons in the configurations 
>i /2 ) 2 and (p 3 / 2 ) 2 . How would the angular momentum eigenstates look if the two 
irticles were a neutron and a proton but otherwise had the same quantum numbers 
before? 

Exercise 22.4. Show that if two identical particles with the same quantum 
imbers a and with angular momentum j couple to zero total angular momentum, 
e resulting pair state is, in an obviously simplified notation, 

¥© = [2(2/ + I]" 172 X (-lTaW-JO) (22.5) 

m=-j 

Angular Momentum and Spin One-Half Boson Operators. If we postulate a 
fiitious boson with spin one-half, and with no other dynamical properties, the total 
igular momentum operator (22.1) for a system of identical particles of this kind 
kes the form 

$ = \ (4/2 al l/2 )(r( a ' /2 'j (22.6) 

2 \a-mj 

ere the creation operators for the two spin states, m — +1/2 and —1/2 are denoted 
mply by a\ /2 and aL m . Equation (22.6) may be decomposed into 

h 

$+ = ft a \/ 2 a -i /2 ~ ftt^-in a i /2 = yf^\i 2 a \n ~ a -m a -in) (22.7) 
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in agreement with our expectations for J> ± as the raising (lowering) operator that 
changes the state \JM) into \ JM±l). Using the boson commutation relations, we 
derive from (22.7) the relation 


$ 


2 _ 


h 2 h 2 

— (a\ / 2 ai/2 + aLi / 2 a- y 2 ) 2 + ~ {.<An a \n + a ^-\i2 a -iii) 


( 22 . 8 ) 


h 2 9 h 2 , N (N 

= rrN 2 + — iv=ft 2 -- + 1 
s 4 2 2 \ 2 


Hence, the state with a total number n = 27 of identical spin one-half bosons is an 
eigenstate of f 2 with eigenvalue 7(7 + 1 )h 2 , where 7 is either integral or half- 
integral. The simultaneous eigenstates of the occupation number operators 
N+ = a\, 2 ai ,2 (number of “spin up” bosons) and N _ = a f - m a-m (number of “spin 
down” bosons) are also simultaneous eigenstates of $ 2 and $ z — h{N+ — N-)/2. 
The eigenvalues, n ± , of the occupation number operators are determined by the 
relations 


n = n+ + n_ = 27 n+ — n_ = 2 M (22.9) 

or 

n+ = J + M n- = J - M (22.10) 


Hence, by (21.36), the eigenstates, normalized to unity, are 


|7M> = 


(aI/2) J+A V-i/2r M |ftN 

V(7 + M)!(7 - M)\ 


( 22 . 11 ) 


The vacuum state, corresponding to zero boson occupation, represents zero angular 
momentum, or 1 0, 0) = |0). 

In terms of the vector model of angular momentum, the representation (22.11) 
of the state \JM) may be recognized as the projection Mh of the resultant of 27 spin 
one-half vectors combined to produce the “stretched” vector polygon with all spin 
one-half vectors “parallel.” The requirements of Bose-Einstein statistics for the 
spins that make up this resultant cause this state to be uniquely defined. At the level 
of the representations of the group SU( 2), discussed in Chapter 17, the connection 
between spin one-half bosons and the generators of the rotation group can be un- 
derstood as an extension of the Clebsch-Gordan formula (17.71) for a direct product 
of n two-dimensional representations of SU( 2): 

(2 - (-l)"\ Al 

2®2®2(x)... = (n + l)©(n- l) A -> 0 ... © ( - j (22.12) 


where the superscript A k (k — 1 , . . n — 1 ) denotes the multiplicity for each irreduc- 
ible representation contained in the direct product. The n + 1 -dimensional repre- 
sentation n + 1 is uniquely contained in (22.12). It corresponds to the linear trans- 
formations among the n + 1 totally symmetric stretched-configuration basis states 
for the system of n spin one-half bosons. 


Exercise 22.5. Work out the decomposition of the direct product (22.12) for 
n = 1 to 6. 
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The violation of the connection between spin and statistics implied by the use 
>f spin one-half bosons in this section does not vitiate the mathematical procedure 
hat we have outlined. The “spin up” and “spin down” bosons defined here are 
lot particles in the usual sense, since they have no momentum or energy. Rather, 
hey are abstract carriers of spin, allowing an elegant description of angular mo- 
nentum states. As auxiliary entities, these bosons may be used for a relatively easy 
valuation of the Clebsch-Gordan coefficients and of the more complicated struc- 
ures that arise in the coupling of more than two angular momenta. 2 

>. First-Order Perturbation Theory in Many-Body Systems. A simple and im- 
(ortant illustration of the use of two-particle operators is afforded by a first-order 
>erturbation calculation of the energy eigenvalues of a Hamiltonian which describes 
i system of interacting identical particles: 


S'f = 2 SiO-ai + ~ 2 a\ala s a t {qr\V\ts) 

i ** 


(22.13) 


qrst 


t is assumed that the eigenstates of the unperturbed Hamiltonian of noninteracting 
(articles 


= 2 


(22.14) 


ire known and characterized as |n x , n 2 , . . . n t . . .) by the eigenvalues n t of the 
iccupation number operators a]a t . If the eigenvalues of "Kq are nondegenerate, first- 
irder perturbation theory gives for the energies the approximate values 




= 2 + ]- 2 {n x n 2 . . ,| alala.a^n^ . . .)(< 2 t| V|ta> 

i £ 


(22.15) 


qrst 


n evaluating the matrix element of the operator ala}a s a t , it is helpful to recognize 
hat, owing to the orthogonality of the unperturbed eigenstates, nonvanishing con- 
ributions to the interaction energy are obtained only if q A r and either s — r and 
= q or s = q and t = r, or if q = r = s — t. Equation (22.15) is therefore reducible 
o 


1 


E ni n 2 ... = 2 n i E i + ~ 2 n q n r [{qr\V\qr) ± (qr\V\rq)] 


q=tr 


+ \ 2 n q(n q - 1 ){qq\V\qq) 

^ a 


(22.16) 


The + sign holds for Bose-Einstein statistics and the — sign for Fermi-Dirac statis- 
ts. The two matrix elements (qr\V\qr) and (qr\V\rq), connecting the two one- 
>article states q and r, are said to have direct and exchange character, respectively. 
The last term in (22.16), which accounts for the interaction of particles occupying 
he same one-particle state, vanishes for fermions, since in that case n q — 0 or 1 
Pauli exclusion principle). 

The evaluation of a matrix element of the product of several creation and an- 
dhilation operators carried out here is typical of most calculations in many-body 
heories. The labor involved in such computations is significantly reduced if the 
jperators in a product are arranged in normal ordering, i.e., with all annihilation 
>perators standing to the right of all creation operators. The operators in the Ham- 


2 For a full treatment, see J. Schwinger, On Angular Momentum in Biedenharn and Van Dam 
1965), p. 229. 
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iltonian (22.13) are already normally ordered. If a product is not yet normally or- 
dered, it may, by repeated application of the commutation relations, be transformed 
into a sum of normally ordered products. A set of simple manipulative rules may be 
formulated 3 which permit the expansion of an operator of arbitrary complexity into 
terms with normal order. 

As an example, we choose the fundamental problem of atomic spectroscopy , 
the determination of energy eigenvalues and eigenstates of an atom with n electron. 4 
If all spin-dependent interactions are neglected, only electrostatic potentials are ef- 
fective. In this approximation, both the total orbital and the total spin angular mo- 
mentum commute with the Hamiltonian. As was suggested in Section 18.6, it is 
practical to require the eigenvectors of %£ 0 , on which the perturbation theory is based, 
to be also eigenvectors of the total orbital and the total spin angular momentum. A 
level with quantum numbers L and S is split by the spin-orbit interaction into a 
multiplet of eigenstates with definite J values ranging from \L — S| to L + S. This 
scheme of building approximate energy eigenstates for an atom is known as L-S (or 
Russell-Saunders ) coupling. 

If is a central-force Hamiltonian for noninteracting particles, the unperturbed 
eigenstates are characterized by the set of occupation numbers for the one-particle 
states, or orbitals, with radial and orbital quantum numbers n h €,. Each pair of 
quantum numbers n t , €, defines an zth atomic (sub)shell. A set of occupation numbers 
for the atomic orbitals is said to define a configuration. A particular configuration 
usually contains many distinct states of the product form 

Eigenstates of 3€ 0 that are represented by a product of n creation operators are called 
independent particle states. 

Although, generally, knowledge of the atomic configuration and the quantum 
numbers L, S, M L and M s is not sufficient to specify the state of an atom unambig- 
uously, in simple cases, such as near closed shells, these specifications may deter- 
mine the state uniquely. The states of the two-electron atom (e.g., neutral helium) 
may be fully classified in this way, and we will discuss these in some detail. 

If the two electrons are in different shells, the states of any two-electron con- 
figuration (ni€i)(« 2 ^ 2 ) which are simultaneously eigenstates of the total orbital and 
the total spin angular momentum are, according to Eq. (22.2), 

\^ tinii 2 {LSM L M s )) 

— 2 LM l ) 2 (h 1 m l m 2 | 

mim2 m\m'2 

(22.17) 

If the two electrons are in the same shell and the configuration is (n/) 2 , it is legitimate 
to set n x = n 2 = n and €j = € 2 = € in (22.17), provided that a normalization factor 
of 1/V2 is furnished. 

Exercise 22.6. Use the symmetry relations for Clebsch-Gordan coefficients to 
show that a configuration (n £) 2 can only give rise to spin-orbit coupled two-electron 
states for which L + S is even, i.e., states l S, 3 P, l D, and so on. 

3 Koltun and Eisenberg (1988), Chapter 8. 

4 A useful introduction to atomic, molecular, and solid state applications of quantum mechanics 
is Tinkham (1964). 
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The ground state of the neutral helium atom is described by the configuration 
ds) 2 and has the spectroscopic character ^o- In our notation, this state may be 
expressed as 

I ^io\o(0000)) ~ a ioo,-i/ 2 a ioo,i/ 2 1 0) (22.18) 

rhe configuration of the simplest excited states is (l.s)(n€) with n > 1. Since the 
wo spins may couple to 0 or 1, the excited states (L = €) are classified as singlet 
S — 0) and triplet (5=1) states. With the appropriate values for the Clebsch- 
Sordan coefficients substituted in (22.17), we obtain for the triplet states: 

l^ionX^l m, 1 )) = &n€m, 1 / 2 ^ 100 , 1/2 | 0 > 

I 0)) ~ ~\/2 ^ a " <m,_1/2a ^ 00 ’ 1/2 a "f"i.l/ 2 a 100 ,- 1 / 2 ) 1 0) (22.19) 

— 1)) = a nem,- l/2 fl 100,- 1/2 I 

ind for the singlet states: 

I ^ (al em - 1 / 2 ^ 100 , 1/2 ~ a \emM2 a \aQ,- Ml) I 0 ) (22.20) 

Dwing to the anticommutation properties of the creation operators, the triplet states 
ire symmetric under exchange of the spin quantum numbers of the two particles and 
mtisymmetric under exchange of the set of spatial (radial and orbital) quantum 
lumbers. The situation is reversed for the singlet states. 

The perturbation interaction, arising from the Coulomb repulsion of the elec- 
rons, is diagonal with respect to all the unperturbed states that we have constructed, 
ind the first-order corrections to the energy are the expectation values of the inter- 
iction in these states. These energies were already worked out in terms of direct and 
ixchange integrals in Section 18.8. We now see that the identity of the electrons, 
nanifested in their statistics, results in a definite correlation between the spatial, 
irbital, symmetry and the total spin S of the system. The states of parahelium are 
linglet states, and the states of orthohelium are triplet statqs. In complex atoms, the 
:onnection between S and the spatial symmetry of the state is less simple and not 
lecessarily unique, but S remains instrumental in classifying the orbital symmetry 
>f the states and thus serves as a quantum number on which the energy levels depend, 
wen though spin-dependent interactions are neglected and the interaction depends 
mly on the position coordinates of the electrons. 5 

I. The Hartree-Fock Method. One of the most useful methods for approximating 
he ground state of a system of n interacting fermions is based on the variational 
iroperty of the Hamiltonian 

bl(a\H 0 \a')b a . + \ 2 b f a bp{ a/3\V\a' f3')bp>b a ’ (22.21) 

ctct' ^ af3a r 1 3 ' 

rhe essence of the Hartree-Fock method is to seek a new one-particle basis with 
:reation operators a\ such that the independent-particle state 

|¥„> = aWn- 1 • • • ] 0> (22.22) 


5 For a compact treatment of the theory of atomic spectra in terms of the second quantization 
ormalism, see Judd (1967). 
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renders the expectation value of K stationary. In this new basis, the Hamiltonian 
appears as 

%€ = 2 al(i\H 0 \t)a e + ^ 2) ala\(qr\V\ts)a s a t (22.23) 

The exact eigenstates of the Hamiltonian are usually not as simple as |' V P„) but can 
be thought of as linear combinations of independent-particle states, with the ex- 
pression (22.22) as the leading term. Although the variational method per se does 
not single out the ground state from all energy eigenstates, the ground state is of 
paramount interest, and the knowledge that it minimizes the expectation value of ffl 
greatly aids its determination. We will use | ']/'„) as written in the form (22.22) to 
denote the ground state. Excited states of the n-particle system will then be expressed 
in terms of |''P 1/ ). For example, the state aja k (with k and j labeling occupied 
and unoccupied ond-particle states, respectively) is an independent-particle state 
similar in structure to (22.22) but orthogonal to |\P„) and may be regarded as an 
approximation to an excited state of the system. 

The variation to be considered is a basis change, which is a unitary transfor- 
mation and expressible as 

a\ + Sat = 2) a j( 8 jk + isjic) (22.24) 

j 

or 

8a k i X ^0 

j 

with transformation coefficients e jk such that 

s jh\ « 1 

The general variation of the state | 'P,,) can be built up as a linear combination 
of independent variations of the form 

1 8%^ = Sj k aja k \y v ) or (8% k \ = s* k (^ v \ataj (22.25) 

where, acting on a ket on the right, a k must annihilate a fermion in one of the 
occupied one-particle states 1, . . . , n, and a] must create a particle in one of the 
previously unoccupied one-particle states n + 1 , . . . . The unitarity of the transfor- 
mation coefficients in (22.24) requires that the e jk form a Hermitian matrix. Since 
the variation 1 8^ kj ), with subscripts reversed from (22.25), vanishes owing to the 
exclusion principle, the condition s, kj — s* Jk can be ignored, and the independence 
of the s- variations is assured. (Variations with j — k do not change the state and are 
therefore irrelevant.) 

We may thus confine our attention to variations | SW) of the form (22.25) which 
are orthogonal to the ‘ ‘best” state ] T f „) of the form (22.22). The variational theorem, 

8{W) - 0 

in conjunction with the Hermitian property of 'K, requires that 

The orthogonality of | and |''P I ,) guarantees that the variation preserves the nor- 
malization of the state, and according to the last equation, makes it necessary that 
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8W) also be orthogonal to , &C\''P V ). Hence, the variational condition is ( Brillouin’s 
heorem) 


- 0 


(22.26) 


f the Hamiltonian (22.23) and the variation (22.25) are substituted into this con- 
lition, we obtain 

2 ^ 2 (qr\V\ts)(y v \aia j ala1.a s a'\ r <lr v ) = 0 

it ^ qrst 

lince k labels an occupied one-particle state in and j labels an unoccupied 

>ne, the last relation is seen to be equivalent to the equation 

(j\H 0 \k) + i [(jt\V\kt) - (jt\V\tk)] = 0 (22.27) 

t= i 

"he sum over t is to be taken only over the occupied one-particle states. 

Condition (22.27) suggests the introduction of an effective one-particle Ham- 
Itonian, H hf , defined by its action on the (as yet undetermined) one-particle energy 
igenstates | m): 

oo n 

H hf \ m) = H 0 \m) + 2 |p) 2 i(pt\V\mt) - (pt\V\tm)] = s m \ m) (22.28) 

p= i t— i 

Vith this definition, condition (22.27) can be construed as expressing the orthogon- 
lity between the occupied and unoccupied eigenkets of H hf : 

< j\H HF \k > = 0 (22.29) 

If in the original one-particle basis b f a the interaction between the fermions is 
liagonal and represented as 

<ajS|V|a'/3'> - V^8 aa ,8^ (22.30) 

iq. (22.28) takes the form 

n 

r d HF \m) = H 0 \m) + S \a){t\P)V ali [{p\t)(a\m) 

'=i (22.31) 

- { a\t)(l3\m )] = e m \ m) 

?he summation over the Greek indices extends over the complete set of one-particle 
tates. Equations (22.28) and (22.31) are known as the Hartree-Fock equations. 
7 rom (22.28) we immediately infer that 

n 

(m\H 0 \ m) + ^ [( mt\V\mt ) — (mt\V\tm)] = s m (22.32) 

r=l 


Exercise 22.7. Verify that the one-particle Hartree-Fock Hamiltonian H hf is 
lermitian. 

The occupied states |r) in Eqs. (22.27) and (22.31) are not at our discretion. 
Since 1^^) is to be a trial vector approximating the ground states, they must be 
fiosen from among the eigenkets of (22.31) in a manner that will minimize the 
:xpectation value of the Hamiltonian. Frequently, the best choice corresponds to the 
ise of those eigenkets that belong to the n lowest eigenvalues e k , although, perhaps 
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contrary to expectations, the variationally minimal value of is not just the sum 

n 

of the Hartree-Fock one-particle energies, 2 s k • Rather, the Hartree-Fock approx- 

k=\ 

imation E v to the ground state energy is 

n i n 

E v = m = (VMV v ) = 2 (k\H 0 \k) + - 2 \Wv\ tk) - <€*|V|*€>] 

k= l 4 e,k=i 

= ^ 2 [e* + Wol*>] = 2 ** - \ i K^|y|a> - <a|v|M)] 

£ k= 1 A=1 ^ 

(22.33) 

For the “excited” state a]a k \"^ v ), we obtain 

<30 = (VvlatajXajat |^„) = E v + e, - e k - (jk\ V\jk) + (jk\V\kj) (22.34) 

If the last two terms can be neglected, Sj — e k represents an excitation energy of the 
system. 

Exercise 22.8. Verify expression (22.34). 

Exercise 22.9. Prove that the expectation value of Hi in the “ionized” state 
a k | with n — 1 particles is 

( c Ji) — E v — s k ( Koopmans ’ theorem) (22.35) 


The practical task of solving the Hartree-Fock equations is far from straight- 
forward. The equations have the appearance of a common eigenvalue problem, but 
the matrix elements of the interaction V, which enter the construction of the effective 
one-particle Hamiltonian H hf , cannot be computed without foreknowledge of the 
appropriate n eigensolutions \t) of the coupled equations (22.31). These equations 
are nonlinear and require an iteration technique for their solution. One starts out by 
guessing a set of occupied one-particle states |f); using these, one calculates the 
matrix elements of V, and one then solves the Hartree-Fock equations (22.31). If the 
initial guess, based on insight and experience, was fortuitously good, n of the ei- 
gensolutions of (22.31) will be similar to the initially chosen kets. If, as is more 
likely, the eigensolutions of the Hartree-Fock equations fail to reproduce the start- 
ing kets, the eigensolutions corresponding to the lowest n eigenvalues s k are 
used to recalculate the matrix elements of V. This procedure is repeated until a self- 
consistent set of solutions is obtained. Sufficiently good initial guesses of the one- 
particle trial states are usually available, so that in actual practice fairly rapid con- 
vergence of the iteration process is the rule rather than the exception. 

In the representation that diagonalizes V, the Hartree-Fock equations can be 
rewritten in matrix form as 


2 [(cx\H 0 \p)(p\m) + 2 {t\fi)V a p{fi\t){a\m) - <f|/3>V a/s <a|f><jS|/n>] 

* 1=1 (22.36) 

= e k (a\m) 

As an application of these equations, we consider an atom with a nuclear charge 
Ze and with n electrons. Then 


r 
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he electron-electron interaction V is diagonal in the coordinate representation and 
is the form 


V(r<r, r'o-') 


r - r 


fe choose the coordinate representation with spin as the basis [a) and [ /3), and we 
inote the Hartree-Fock eigenfunctions as 

<ro-|m) = lAm(ro-) 

/ith this notation, the Hartree-Fock equations (22.36) are transcribed as 


2m 


V 2 4t m (ra) 


- — iAm(rcr) + e 2 ^ f iAT(r'cr ') - — — - a')i}r m (rcr) d 3 r' 

r t =\ u’ J I r — r j 

f — - tp t (va)\p m {r' a') d 3 r' = e m ip m (rcr ) 

t=\ a’ J r - r 


(22.37) 


hese coupled nonlinear differential-integral equations constitute the most familiar 
ialization of the Hartree-Fock theory. The first sum on the left-hand side (without 
le term t = m if m is an occupied state) represents the average effect of the inter- 
ction between all the other electrons in occupied one-particle states. The last sum 
n the left-hand side is attributable to the exchange matrix elements of the interac- 
on. 


Exercise 22.10. Show that the configuration space wave function correspond- 
ig to the independent particle state (22.22) can be expressed as the Slater deter- 


xinant 

<Mri0-i) 

i/b (r 2 o- 2 ) 

■ ■ ■ <Mr 


1 

iMriO-i) 

i// 2 (r 2 cr 2 ) 

* 'l'2(rn<Tn) 

(22.38) 

^(rjo-!, . . . , r„cr„) - / — 

V«! 

^(rio-j) 

^n(r 2 o- 2 ) 

• • • <An(rn^n) 


. Quantum Statistics and Thermodynamics. The many-body operator formal- 
sin of Chapter 21 is ideally suited for treating statistical ensembles of identical 
articles. Here we will derive the quantum distribution functions for a system of 
oninteracting particles in thermal equilibrium. 

If p denotes the density (or statistical ) operator for an ensemble with fixed 
alues for the averages of and N, statistical thermodynamics requires that the von 
leumann entropy, 

S — —k trace(p In p) (22.39) 

>e made a maximum subject to the constraints 

(N) = trace(plV) = n, (%£) = trace(pTf) = E, trace(p) = 1 (22.40) 

"he entropy principle is based on the probability and information concepts intro- 
luced in Section 2 in the Appendix and Section 15.5. Except for the multiplication 
iy Boltzmann’s constant k, the entropy S is that defined in Eq. (15.128). 
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Using the Lagrangian multipliers a and /3, we see that the variational principle 
takes the form 

8(S - ka(N ) - kp(%t)) = 0 (22.41) 

The normalization constraint in (22.40) requires that the variations of the density 
operator be restricted to 

, trace(Sp) = 0 (22.42) 

and, therefore, 

8S — —k trac e(5p In p + 8p) — — k trac e(Sp In p) 

Substituting all the variations into (22.41), we obtain 

trace [§p (In p + aN 4- fHW)] = 0 
which is consistent with (22.42) only if 

In p + aN + — —In Z 1 

where Z is a number. We thus arrive at the grand canonical form of the density 
operator: 

(22.43) 

The normalization condition gives us 

Z = trace | (22.44) 

which is called the grand partition function. The parameters a and ft must be de- 
termined from the first two constraint conditions (22.40). By thermodynamic argu- 
ments, /3 = 1/kT is a measure of the temperature and p, = — a//3 is identified as the 
chemical potential. 

Exercise 22.11. Evaluate the entropy for the equilibrium state (22.43), and 
show that 

-AT In Z = <3E> - p(N) — TS — E — TS — pn (22.45) 

which is the grand canonical potential (or generalized free energy), suitable for 
relating thermodynamic variables to the underlying microscopic description. 6 



For a system of noninteracting identical particles with one-particle energies e h 
known in thermodynamics as a generalized ideal gas, 

M - X =2 SiNt (22.46) 

i i 

The ensemble average of any physical quantity represented by an operator Q 
may be computed by application of the formula 

( Q ) = trace pQ (22.47) 


6 Callen (1985), Section 5.3, and Reif (1965), Section 6.6. 
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Nt apply this relation to the evaluation of the average occupation numbers Np 

( Ni } = ( aja t } — trac t(e~ aN ~ p ^d\a l )IZ (22.48) 

Jsing Eqs. (21.31)— (21 .33) and the identity (3.59), we find that 

trace(e _ “ /V_/3 ^a|a J ) = e ~ (a+ ^ trac e(e~ aN - p3e a ia J) (22.49) 

Exercise 22.12. Verify Eq. (22.49). 

If the commutation relations for bosons or anticomutation relations for fermions 
ire used, we obtain (with the upper sign for bosons and the lower sign for fermions) 

trace(c~ aN ~ ^ x aJ a t ) = e~ Ca+l3ei) trace[e - “ yV- ' 3 ^(l ± a/a,)] 

Combining this relation with (22.43), we obtain 

(22.50) 

which is the familiar formula for the distribution of particles with Bose-Einstein 
sign) and Fermi-Dirac (+ sign) statistics, respectively. 

The connection with the more conventional method for deriving the distribution 
'22. 50) is established by introducing the occupation numbers n t as the eigenvalues > 
i>f Ni = a/a,- and the corresponding eigenstates | n u n 2 , . . . n,-, . . .) as basis states of 
:he ideal gas. In this representation, the grand partition function becomes 

Z = 2 FI e- (a+Ps ‘ )n ‘ (22.51) 

The distribution (22.50) is recovered by computing 

(22.52) 

1 6 dSi 

which follows from (22.44) and (22.48). The two kinds of quantum statistics are 
distinguished and their partition functions are different, because in the Bose-Einstein 
case the occupation numbers assume all nonnegative integers as eigenvalues, 
whereas for the Fermi-Dirac case, n, — 0, 1 are the only possible values. 

The derivation of (Ni), using operators rather than the occupation-number basis, 
is intended to exhibit as plainly as possible the connection between the commutation 
relations for bosons and the anticommutation relations for fermions and the — and 
+ signs, respectively, which characterize the denominator of the two distribution 
laws. 

The Maxwell-Boltzmann distribution, 

(N t ) - e- a ~ e ‘ ,kT (22.53) 

is an approximation for the quantum distributions (22.50), valid if (N,-) « 1. This 
may be regarded as a low-density or high-temperature approximation. 

Exercise 22.13. Using operator algebra, show that the square of the fractional 
deviation from the mean occupation number is 

m - (N,)) 2 ) = (Nj) - (IV,) 2 = a+/3E . = J_ 

(Ni) 2 (N^ 2 6 (N^ ~ 



(22.54) 
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where the + sign pertains to Bose-Einstein statistics and the — sign to Fermi-Dirac 
statistics. Also consider the Maxwell-Boltzmann limit. 

In the next chapter, the formalism developed here will be applied to the deri- 
vation of the Planck distribution for photons in thermal equilibrium. 


Problems 

1. Consider a system of identical bosons with only two one-particle basis states, 
al/ 2 '^ f(0) and fl- 1 / 2 ' v l / ' (0) - Define the Hermitian operators x, p x , y, p y by the relations 

if p x \ 1 ( .Py 

““ = vsr r + ‘ 7> = ‘7 

where c is an arbitrary real constant, and derive the commutation relations for these 
Hermitian operators. Express the angular momentum operator (22.6) in terms of these 
“coordinates” and “momenta,” and also evaluate J? 2 . Relate J 2 to the square of the 
Hamiltonian of an isotropic two-dimensional harmonic oscillator by making the iden- 
tification c = Vmffl, and show the connection between the eigenvalues of these op- 
erators. 

2. (a) Using the fermion creation operators a] m , appropriate to particles with angular 
momentum j, form the closed-shell state in which all one-particle states m = —j to 
+j are occupied. 

(b) Prove that the closed shell has zero total angular momentum. 

(c) If a fermion with magnetic quantum number m is missing from a closed shell 
of particles with angular momentum j, show that, for coupling angular momenta, the 
hole state may be treated like a one-particle state with magnetic quantum number —m 
and an effective creation operator (— l) J ~ m a Jm . 

3. Consider the unperturbed states a\ mn • • ■ a kmk • • • a\ m] | 0) of n spin one-half particles, 
each occupying one of n equivalent, degenerate orthogonal orbitals labeled by the 
quantum number k, and with m k -± 1/2 denoting the spin quantum number associated 
with the orbital k. Show that in the space of the 2" unperturbed states a spin-inde- 
pendent two-body interaction may, in first-order perturbation theory, be replaced by 
the effective exchange (or Heisenberg ) Hamiltonian 

^eff = ~j 2 X (H | V\ £k)S k ■ S ( + const. 

7i ke 

where S*. is the localized spin operator 

Si x a inflkm‘ k (m k I <r I m’ k ) 

Z m k m’ k 

4. For a Fermi gas of free particles with Fermi momentum p F , calculate the ground state 
expectation value of the pair density operator 

X tyl-' (r , )*J*ff"(r ,, ) l IV’( r,, )*!v (r ' ) 

or' , or” 

in coordinate space and show that there is a repulsive interaction that would be absent 
if the particles were not identical. Show that there is no spatial correlation between 
particles of opposite spin. 
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!. Calculate in first order the energies of the l S, 3 P, and l D states arising from the atomic 
configuration p 2 (two electrons with € = 1 in the same shell). Use the multipole 
expansion 


e 


2 


r' 



y 477 

*=0 2k + 1 


k 

y k (r',d') 2 i-1)* YUnn 9 <T) 

q=-k 


for the interaction energy between the electrons, and show that the term energies may 
be expressed as 


E( l S) = E 0 + <y 0 > + — <y 2 > 
E( 2 P) = E 0 + (y 0 ) - {y2) 
E( l D ) = E 0 + (y 0 ) + ^ (y 2 > 


where (y k ) is the radial integral 

<y*> = e 2 jj y k (r', r")[R{r')f[R{r")?r' 2 r" 2 dr' dr" 

i. Apply the Hartree-Fock method to a system of two “electrons” which are attracted 
to the coordinate origin by an isotropic harmonic oscillator potential mto 2 r 2 / 2 and 
which interact with each other through a potential V = C(r' — r") 2 . Solve the Hartree- 
Fock equations for the ground state and compare with the exact result and with 
first-order perturbation theory. 



CHAPTER 23 


Photons and the Electromagnetic Field 


The formalism developed in Chapter 21 for the quantum description of 
identical particles is sufficiently general to be applicable also to photons, 
the particles associated with the electromagnetic field, even though 
photons are relativistic particles. Instead of deriving the quantum theory 
of the electromagnetic field by applying quantization postulates to the 
classical theory, we build the elements of the quantum theory of radiation 
on the foundations of ordinary quantum mechanics for particles and show 
that in the Correspondence limit the classical Maxwell theory is recovered. 
Our goal is to acquire enough background for a first-order account of the 
interaction between photons and charged particles (mainly electrons in 
atoms) and the principles of quantum optics. With these tools it is then 
possible to deepen our understanding of optical coherence, correlation, 
and interference phenomena. 

1. Fundamental Notions. The theory is based on empirical facts about photons: 
they are bosons with zero mass; in free space they can carry linear momentum hk 
and energy ho) k = chic, and simultaneously with sharp linear momentum, they can 
have a definite value +h or — h for the component J • k of angular momentum along 
the direction of propagation. 

A 

Photons with a sharp value for J • k are said to possess definite positive or 
negative helicity. Since the component of orbital angular momentum parallel to k is 
necessarily zero, spin 1 may be attributed to the photon, provided that the eigenvalue 
zero of the projection J • k is excluded. The consistent omission from the theory of 
the eigenstate for which J • k has the value zero is possible only because the photon 
has vanishing mass and cannot be brought to rest in any Lorentz frame of reference 
for the purpose of measuring its intrinsic spin separately from the orbital angular 
momentum that it carries. 

By definition, the operators a+(k) and dL(k) correspond to the creation of a 
photon with momentum hk and with helicity J • k = +h and —h, respectively. The 
momentum states, or photon modes, are subjected to periodic boundary conditions 
defining a discrete set of k vectors with components 27 m x /L, 2n rn y /L, 2i rnJL, where 
n x , n y , n z are integers (0, ±1, ±2, . . .) as in (4.53). The transition to the continuum 
may be made by letting the arbitrary length L — » 00 , or formally by simply setting 
L = 2tt and replacing all sums over k by integrals and Kronecker deltas by delta 
functions. All spatial integrations extend over the cube of volume L 3 . 

The commutation relations for the photon creation and annihilation operators 
are 

[a + ( k), a f + (k')] = [a_(k), at(k')] = (23.1) 

All other commutators vanish. 

The important physical operators for a system of free photons are the total 
energy, 

W, = ^ chk[cd + {k)a + (k) + nl(k)a_(k)] 

k 


(23.2) 
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tie total linear momentum, 

P = 2 ftk[flt(k)a + (k) + al(k)a_(k)] (23.3) 

k 

nd the total number of photons, 

N = S [«Uk)a + (k) + al(k)fl_(k)] (23.4) 

k 

The operator $ • k, representing the total angular momentum component along 
tie direction k, is an additive one-particle operator and thus a bilinear combination 
'f a+(k'), at(k'), a+(k"), a_( k"). Owing to the helicity property of photons, the 
iperators creating and annihilating photons of momentum hk occur in this linear 
ombination only in the diagonal terms a+(k)a + (k) and at(k)<a_(k). These terms, 
i'hich represent the contribution to $ ■ k of photons with linear momentum k, may 
ie written as 

\$ ■ k, a+(k)]a+(k) + [$ • k, al(k)]a_(k) - h[al(k)a + (k) - al(k)a_(k)] (23.5) 

"his operator generates rotation of the states with definite k about the direction k. 
According to Chapter 17 and Section 21.2, we expect that such a rotation changes 
:t(k) into a\{ k)e~ ia and a!(k) into a-(k)e ia , if a is the angle of rotation. 

Exercise 23.1. Using the commutation relations, prove that 

U R at Ur — — e^ ia ?+(k) 


It is apparent that under the simple rotation considered, a+(k) and ak(k) trans- 
orm like the T\ and 7Y 1 components of an irreducible tensor of rank one defined 
n Section 17.7. Using Eqs. (17.85), we introduce the new operators 


aftk) = ^ [~fl+(k) + cl(k)] 
4(k) = ^ [fl+(k) + al( k)] 


(23.6) 


Jnder a rotation about k by an angle a, these operators are transformed into 

a\(k) cos a + a\{ k) sin a and — aj(k) sin a + a\( k) cos a 

ust as two orthogonal components of a vector in a plane perpendicular to k. If we 
ntroduce two perpendicular unit vectors e^ 0 and e^ 2) in the plane perpendicular to 
;, such that 

gU) x e? = k, e? Xk = eL°, k X e? = e^ 2) (23.7) 

t follows that 

fli(k)e^ + al(k)e? = atOkX-g^ + ie?) + ~ al.(k)(e? + ie?) 

ransforms like a three-vector that is perpendicular to k. 

The corresponding Hermitian conjugate operator is 

— a + (kM> + ie?) + a_(k)(e^> - ie?) (23.8) 


fliCk)^ 0 + a 2 (k)e? = 
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The right-hand side can be expressed more simply is we introduce the complex 
unit vectors, 



it 

Equation (23.8) then takes the form 


(23.9) 


fljO t)eL 1} + « 2 (k)eL 2) = a+(k)ek +) + a_(k)eL~ ) (23.10) 

This relation constitutes a unitary transformation from a helicity representation to a 
linear polarization representation. 

Exercise 23.2. From the definition (23.9), calculate the Hermitian scalar and 
vector products of the unit vectors ek +) , ek _) , and k. 


Exercise 23.3. Show that 

[a,(k), a\m = [(h( k), aim = 1 (23.11) 

It is tempting now to consider a unitary transformation from the momentum 
representation, which we have used to introduce photons, to a coordinate represen- 
tation by the application of the one-particle transformation coefficients 

(r|k> — L~ 3 ' 2 e ik T (23.12) 

In this way, it is possible, as in Section 21.5, to define vector operators that describe 
the creation (and annihilation) of a photon at a position r, such as 

'f'V) = 7^2 [«I(k)e^ + al( k)ek 2) ]<r* r (23.13) 

l k 

In conformity with Eq. (21.56), one could then also introduce a photon wave func- 
tion in configuration space. However, such a representation — straightforward for the 
case of nonrelativistic particles— -is of limited usefulness for photons. This is most 
easily seen by examining an observable like the energy of the system which, after 
transformation to coordinate space, will appear in the form 

3C = ^ 2 JJ 4»V) ‘ *K r ) k e ik<r - T,) d?r d 3 r’ 

Since k = Vk* + k 2 + k 2 , the integrand cannot be transformed into a local expres- 
sion involving a finite number of field derivatives and interpretable as an energy 
density. 

Nor is it possible to construct a simple local current density operator from the 
fields ijf(r) and i|/ t (r), although / »jf t (r)»|>(r) d 3 r is a constant of the motion. Last, 
but not least, the field ij/(r) defined by (23.13) does not have simple transformation 
properties under Lorentz transformations. We will see in Chapter 24 that a trans- 
formation analogous to (23.13) can be carried out without any of these stumbling 
blocks for particles of half-integral spin. It succeeds for electrons and neutrinos 
(fermions) but not for bosons! 



172 


Chapter 23 Photons and the Electromagnetic Field 


For photons, these deficiencies are remedied by introducing a slightly modified 
et of field operators : 

A (+) (r) = \/4rrh? X “= [«,(k)e< k 1} + a 2 ( W?]e ik r (23.14) 

L k V2 a) k 


The significant diffe renc e between this vector operator and *Ji(r) is the appear- 
mce of the coefficient V2o) k . We will see that its insertion permits us to write the 
;nergy density and other important observables as simple local bilinear functions of 
he field operators. The constants in the definition (23.14) were chosen with the 
oreknowledge of the ultimate identification of this operator with the vector potential 
>f the electromagnetic field. Equation (23.14) has a Hermitian adjoint companion: 


A (-) (r) - [A (+) (r)] t 

= V 4 ^h? -4 X 77 = [«I(k)e^° + at(k)K 2 ) ]e- >k r 
L k V2 co k 


(23.15) 


U1 field operators are periodic functions of the coordinates x, y, z with period L. 

If we work in the Heisenberg picture, the operators a(k) have a simple time 
lependence, which can be inferred from the equation of motion for the free photon 
ystem: 


ih 


da{ k, t) 
dt 


wk, t\ m 


Yhen the energy operator defined in (23.2) is substituted in this equation, we 
)btain 


da( k, 0 

1 — j — 

dt 


a) k a( k, 0 


vhich has the solution 

a( k, t ) = a(k)e~ ic0kt afk, t) = a\k)e iw ^ (23.16) 

t follows that A (+) (r, t ) is obtained from (23.14) if we replace e lkr by e'< kr -"* r >. 
Similarly, to obtain A (_) (r, t ), we replace e~ ik T in (23.15) by e _ ‘ (k r -"* 0 . These forms 
:xplain why A (+) is called the positive frequency part and A (_) the negative fre- 
quency part of the Hermitian operator A(r, t ), which is defined as the sum of the 
wo: 


A(r, t ) = V4rrh? X 77== iUhiW^ + a 2 (k)&e i( - k r ~^ 

L k \2(o k (23.17) 

+ [aft WP + a\ (k)ek 2) ] e ~ ,(k ‘ r_ "* r) } 

Dwing to the orthogonality of the polarization vectors e k to k, this operator satisfies 
he transversality or Coulomb gauge condition 


V • A(r, t) - 0 

n Cartesian coordinates, it also obeys the equation 


(23.18) 
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because (o k — ck. The Hermitian operators for the electric field E and the magnetic 
field B, defined by 

E = — - — and B = V X A (23.20) 

C dt 


respectively, are local field operators that satisfy the Maxwell equations for the free 
radiation field in the Coulomb gauge, excluding the electrostatic contribution — V$ 
to the electric field which arises from the Coulomb interaction between charges. 

The electric and magnetic fields are usually also partitioned into positive and 
negative frequency parts: 

E = E (+) + E ( "> and B = B (+) + B () (23.21) 


From (23.20) and (23.17) we deduce, explicitly, 

’ ! i 

E (+) (r, t) = iV2^h [fliOOSSP + a 2 (k)e^]e i<kr ~^ 

E k 

E (_) (r, t) = -iVM -^5 2) [4(k)e^ + 4(k)e^ ) ]e _,(kr_<u * f) 

E k 

B (+) (r, t) = iVlirhc 2 ^ ~T= [«i( k ) k X 4 1} + « 2 (k)k X eL 2 ) ]e , ' (k r ““*' ) 

L k v co k 


(23.22) 


B ( } (r, t) = —i^/lirhc 1 ^ r — [4( k ) k x eL 0 + 4( k ) k x e^ 2) ]e ,(k r "* f) 

L k Vce> fe 


a> k 

We also need the Fourier transforms of these vector functions, such as 

i 1 1 


(23.23) 


tfi( k )er + tf 2 (k)er = 


\Z2irh 


(Ok 


1 e -i(k.r-, t ,) E ( +)(r! t) d 3 r (23>24) 


and 


flj(k)k x + a 2 (k)k X ej^ = -—y== V(o k JJH f e ' (kr £ ° A,) B (+) (r, t) d 3 r 

(23.25) 

The identification of A, B, and E as the vector potential in the Coulomb gauge, the 
magnetic field, and the electric field, respectively, will be complete only if we can 
verify that the expressions for physical observables, such as the energy and mo- 
mentum of the field, have the correct classical form in the correspondence limit. We 
outline the proofs in the next section. 


2. Energy, Momentum, and Angular Momentum of the Radiation Field. We 
may now transform the total energy from its energy-momentum form, (23.2), to the 
standard spacetime expression by the use of Eqs. (23.8) and (23.24): 

'K = 2 ^w*[a+(k)<2 + (k) +al(k)a_(k)] 

k 

= 2 fiMkiaf^afk) + 4(k)a 2 (k)] 

k 

= E fttodalik)^ + 4(k)e£ 2) ] • [a 1 (k)ei lj + a 2 (k)ei 2) ] 

k 

= ^3 2 JJ e ik(r - r ')E ( ->(r, 0 • E (+) (r', t) d 3 r d 3 r' 
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id finally, 

(23.26) 

his expression for the energy operator is local in the sense that an energy density 
in be assigned to every point in space, and the integrand sum of all these contri- 
ations makes up the total field energy. 

Exercise 23.4. By expressing the field operators as sums of positive and neg- 
:ive frequency parts, show that 

(23.27) 

'here E 0 is the constant (infinite) zero-point energy 

(23.28) 

loting that the left-hand side of (23.27) is formally the correct relativistic definition 
f the energy of the classical electromagnetic field, show that the photon annihilation 
nd creation operators are, in the classical limit, proportional to the Fourier ampli- 
ades of the classical radiation field. Determine the constant of proportionality. 1 

The additive constant zero-point energy E 0 in (23.27) may seem irrelevant, since 
t merely shifts the origin of the energy scale. However, the frequency spectrum co k 
f the photon modes, and thus the zero-point energy, depends on the boundary con- 
ations imposed on the fields. If, instead of being periodic in a very large cube, with 
tie ultimate limit L -» °°, the fields are confined to a cavity or the space between 
wo conducting plates, the frequency spectrum is altered an'd depends on the dimen- 
ions of the confining boundaries. Under these circumstances, the full expression 
23.27) for the energy is needed to calculate the Casimir (or van der Waals ) forces 
•n the boundary surfaces, which are present even in the vacuum state. If L is the 
inear dimension involved, e.g., the distance between the conducting plates, the force 
nust in magnitude be of the order fic/L 2 . Unlike the “uncertainty pressure” caused 
>y a particle in a box, the Casimir effect represents an attractive interaction. 2 

Exercise 23.5. Show that the total linear momentum (23.3) may be trans- 
ormed into 

(23.29) 

ind establish the correspondence with the classical expression for the momentum of 
he field. 

'Jackson (1975), Section 12.10, and Sakurai (1967), Chapter 2. For more detail on the quantum 
heory of radiation, see Loudon (1983) and Milonni (1995). 

2 Itzykson and Zuber (1980), Section 3-2-4. 
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Equations (23.27) and (23.29) may be cast into a compact relativistic formula 
for the energy-momentum four-vector 


pu- — 



(23.30) 


if the contravariant spacetime coordinate four-vector 

4 x* = (x°, x 1 , x 2 , x 3 ) = ( ct , x, y, z ) = ( ct , r) 

and the metric 


ds 2 = g IJiV dx' x dx v = (dx 0 ) 2 — (dx 1 ) 2 — (dx 2 ) 2 — (dx 3 ) 2 = c 2 dt 2 — dx 2 — dy 2 — dz 2 
are introduced. We thus choose the metric tensor 

g u = g 22 = g 33 = ~g 00 = “I and g^ = 0 if p + v (23.31) 
We note that the covariant four-vector is 

x^ = g^ v x v = (ct, -x, -y, -z) (23.32) 


and that the four-gradient is given by 





(23.33) 


The notation and the metric used here for relativistic quantum mechanics is 
generally the same as is customary in particle physics and quantum field theory, but 
we retain h and c explicitly and employ (unrationalized) Gaussian electromagnetic 
units rather than Heaviside-Lorentz units, as can be seen by the form of the field 
energy, (23.27). (For more discussion of units, see Appendix Section 4.) 

Using the relation V • E = 0, we may transform the integrals (23.26) and (23.29) 

into 


pp — 



E (_) (r, t) ■ d M A <+) (r, t) d 2 r 


(23.34) 


This formula has the usual structure of an additive one-particle operator like (21.62), 
except that the differential operator stands between E (_) and A (+) rather than be- 
tween ^ and i}j. 

The classical expression for the angular momentum of the electromagnetic field 
is 3 

|=-^-frX(EXB)d 3 r (23.35) 

4t rc J 

This may be shown to be the classical limit of the operator equation 

$ = zr~ f S £i _) (r X V)A\ +) d 3 r H — f E ( " ) X A (+) d 3 r (23.36) 

2-7TC J ( = o 27TC J 

Comparison with expression (23.34) for the momentum shows that the first term 
on the right-hand side of (23.36) has the form of an orbital angular momentum 
operator. The second term accounts for the spin of the photons. If the expansions 


3 Jackson (1975), p. 333. 
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3.14) for A {+) and (23.22) for E (_) are substituted, the second term is seen to 
mtribute precisely the value given by Eq. (23.5) to the component of total angular 
omentum along k for each photon momentum state k. The orbital angular mo- 
entum has, as usual, zero projection along the direction of the propagation. 

The photon spin operator may also be written as 

Ml+>\ 

(E^E^E^) S Ay +) d 3 r (23.37) 

U-7 

he 3 X 3 matrices S satisfy the angular momentum commutation relations and have 
genvalues 0 and ±h, thus confirming that photons are particles with spin one. 

Exercise 23.6. Prove the connection between (23.35) and (23.36) by using the 
[entity 

3 

[r X (E X B)L = X E t (x X V) X A { + (EX A) x + V • [E (zA y - yA z ) ] (23.38) 

j=i 

Exercise 23.7. Construct the matrices S in Eq. (23.37) explicitly and verify 
leir commutation relations. 

From the behavior of the various physical quantities associated with the field, 
e infer that the annihilation and creation operators a 2 (k) and af(k) become in the 
lassical limit proportional to the Fourier amplitudes of the electric radiation field 
nearly polarized in one direction, while the operators a 2 (k) and at( k) become pro- 
ortional to the Fourier amplitudes with perpendicular linear polarization. According 
> the relation (23.8), the positive-helicity operators a+(k) and a+(k) correspond in 
le classical limit to the amplitudes of an electric field with left-circular polarization, 
imilarly the negative-helicity operators a_(k) and nt(k) correspond to right- 
ircular polarization in optics. 4 

Exercise 23.8. Show that if viewed by an observer toward whom the light 
?ave is propagating, the real and imaginary parts of the positive-helicity basis vector 

- (e4 1} + ie^e-^ 

^present a counterclockwise rotation of the polarization vector, provided that 

(1) V &(2) _ k 

k A C k — K. 

Exercise 23.9. Work out the equal-time commutation relations between the 
omponents of A(r, t) and E(r, t). 

Interactions with Charged Particles. So far, only free photons have been con- 
idered, and it has been shown that these particles are the quanta of the free elec- 
romagnetic field. With this background, we can now introduce the interaction be- 
ween photons and electrons or other charged particles. Although in many 
pplications it is necessary to use a proper relativistic treatment of the massive 
larticles, a nonrelativistic approximation will be used in this section for describing 
he electron. 


Cf = 

2Trhc . 


/•10'7‘Tl 7 7 
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Following the example established in Section 4.6 for the dynamics of a charged 
particle in an external electromagnetic field, we assume that the Hamiltonian oper- 
ator is obtained from the Hamiltonian for the free particle system by the substitution 

e . 

p — ^ p 4“ — A 

c 

if the particles have charge —e. This prescription leads to the Hamiltonian 
St = ^ J E (-) • E (+) d 3 r + £f^0V + ^A^d 3 r + J d 3 r (23.39) 
for the interacting field operators. 

The first term on the right-hand side of (23.39) refers to the electromagnetic 
radiation field alone, The last term describes the external forces as well as the static 
interactions between the particles, such as the Coulomb repulsion between electrons, 
which has been conveniently separated from the total electromagnetic field for in- 
clusion with the unperturbed Hamiltonian. The middle term, when expanded, con- 
tributes one term that refers solely to the electrons. The remaining terms represent 
explicitly the interaction between radiation and matter fields, which is our main focus 
here. The Hamiltonian is naturally divided into two parts, St — St Q + St x , with St 0 
including the pure radiation and matter terms, while St x symbolizes the interaction 
terms: 

St 0 = ^~ ( E <-) • E (+) d 3 r - — f ijFV 2 i| td 3 r+ f ViJFiJj d 3 r (23.40) 
27 rj 2m J J 

efi C e ^ f 

St i = iJ/A • d?r H — r A • Ai|/'*Ji d 3 r (23.41) 

met J 2 me J 

In deriving (23.40) and (23.41) from (23.39), we have assumed that the Coulomb 
gauge condition (23.18) has been imposed, even if the fields are interacting and 
represent particles that are not free. 

The dynamics of interacting fields, governed by the Hamiltonian (23.39), is 
described by field operators whose structure is far more complicated than the free 
fields introduced in Section 23.1. The theory of interacting quantum fields, with its 
impressive achievements in the high-precision calculations of radiative corrections 
in quantum electrodynamics, lies outside the scope of this book. 5 We restrict our- 
selves to two simplified approaches: problems for which a first-order approximation 
is appropriate, and problems in which the prescribed motion of the charges may be 
treated by classical methods. 

Since the electromagnetic interaction between the radiation field and an electron 
is comparatively weak, the use of perturbation theory is appropriate in many prac- 
tical problems. As an illustration, we will derive the formulas for the intensity of 
emission and absorption of photons by a one-electron atom, neglecting the spin of 
the electron. The operator St 0 describing the free photons and the atom is the un- 
perturbed Hamiltonian, and the interaction St x is the perturbation to which the gen- 
eral methods of Chapter 19 will be applied. 

In first approximation, the first term in the interaction (23.41), which is linear 
in photon creation and annihilation operators, is responsible for transitions in which 
one photon is emitted or absorbed. If the electron is initially in an atomic state 

5 Among many excellent references, we mention Bjorken and Drell (1964 and 1965), Mandl and 
Shaw f'1984'l. and Weinhers ( 19951. 
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jfined by the Schrodinger wave function i/f,( r), the initial state of the electron- 
loton system is, in a somewhat hybrid notation, given by 

I %) = j r)*l» f (r) d 3 r 1 0) el ® | . . . «; A) (k) . . .) (23.42) 

ere j 0 ) e , denotes the no-electron state (electron vacuum), and | . . . /^ A) (k) . . .) sym- 
alizes the state of the electromagnetic field in terms of photon occupation numbers, 
he superscript A specifies the polarization of a photon with wave vector k. In the 
aal state there must be one photon more or one photon less than in the initial state, 
id the electron is annihilated in state i//,(r) and recreated in state <A/( r )- 

In order to evaluate the matrix element of the perturbation term for the transition 
om state i to /, we substitute the expression for the field operator, 

A(r, 0) = \/Airhc 2 2 i — {[a,(k)^ u + a 2 ( k)4 2 V kr 

L k V2 (o k (23.43) 

+ [4(k)4 1) + 4( k)e^]e~ ikr } 

itained from (23.17), into the interaction (23.41). With this interaction, we cal- 
llate the matrix element (^ / |3 f C 1 |'I f ,) for the initial state (23.42) and the final state 

I ¥/> = | <A/( r )^ t (r) d 3 r\0) el ® |. . . , rc< A) ( k) + 1, . . .) ^(23.44) 

he result is 

d. {^FL_} j «> ■ f v* w 

(23.45) 

he upper option in braces applies to absorption of a photon, and the lower option 
fers to emission. If E t and E f denote the initial and final energy of the unperturbed 
om, transitions are appreciable only if the photon energy ha) k — E f — E t or E t — E f , 
spending on whether absorption or emission of a photon takes place. 

Exercise 23.10. Reproduce the derivation of the matrix element (23.45) for 
isorption or emission of a photon by a charged-particle system. 

To illustrate the use of the typical matrix element (23.45) for evaluating tran- 
tion rate with the Golden Rule (19.99), we assume that the photon modes in the 
litial state (23.42) are not selectively occupied. If, as in masers and lasers, the 
xupation number, n h for some modes is large, but small or zero for neighboring 
Lodes, the Golden Rule, which presupposes a weak dependence of the matrix ele- 
ient on the occupation number, may be wholly inappropriate. We already remarked 
i this possibility in Section 19.5. 

Under the usual conditions of observation, the state of the electromagnetic field 
Drresponds to a superposition of occupied photon states in a quasicontinuum, and 
is appropriate to characterize this state by an initial average photon number n) A) ( k) 
>r photons in the polarization mode A and with approximate momentum ftk pointing 
i a solid angle dO k . Since the number of photon modes for each polarization per 
ait energy interval is given by the density of states, 
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the transition probability per unit time is according to the Golden Rule: 


9 7r ry 

w =— p(h(o k ) |<^ / |3fe 1 ['^,)| 2 = <Wfc»i/ ) (k)K/|e ±,kr ei A) • v|i)| 2 dEL k 


(23.47) 


where a = e the is the fine structure constant, and 


i r -ft 

(/ 1 e a±lk ' r efc V) • v|/> = - iA?(r) e ±,kr • - V^-(r) d 3 r (23.48) 
m J i 

is the one-particle matrix element that determines the intensity of the transition. In 
formula (23.47) hi is to be used for absorption and h f for emission of a photon. 

The incident photon flux dl 0 is the number of photons incident on the atomic 
target per unit area and unit time in the frequency interval dto. As explained in 
Section 13.1, 

h ha) 2 dto _ 

dl 0 = —3 chp(hto) da> — — — (23.49) 
L 87 T c 

We define a cross section cr(to) that is sharply peaked at w — ( E f — E^/h [or 
to = (E { — E f )/h] in an interval A cd for photon absorption (or stimulated emission) 
by the relation 

w = f cr((o) dl 0 ~ ^ f cr(to) dto (23.50) 

J dto Jaw 


If (23.47) and (23.49) are substituted in (23.50), the cross section formulas obtained 
in Sections 19.5, and 19.7 by treating the radiation field as classical and prescribed 
are recovered exactly. The rates of transitions of higher multipolarity may be com- 
puted by similar methods. 

Exercise 23.11. Check that the integrated cross section (19.62) is obtained 
from (23.47), (23.49), and (23.50). 


If Hi — 0, formula (23.47) accounts for the rate of spontaneous emission of 
photons in the transition from an excited state of the radiating atom. If the transition 
is assumed to be of the electric dipole type (e !k r ~ 1) and the substitution (19.66) 
is made for the matrix element, the transition rate for spontaneous emission of light 
with a polarization vector e^ A) and frequency to into the solid angle di l k becomes 

w = o) 3 |(/|e^ A) • r | /> | 2 d£l k = -^-5 o) 3 (f • r |/)<i | e£° • r|/> dCl k 

Lire Lire 

The total integrated emission rate 2 w is obtained by summing over the two polar- 
izations and integrating over all angles of emission. Noting that in dyadic notation, 

j [SW + « 2) ] dn k = J [1 - kk] dsi k = Yi (23.5 d 

we obtain 

2 w = co 3 |</|r|0| 2 

4 (23.52) 

= ^ " 3 {K/k • x|/)| 2 + |</|r • y | z>| 2 + K/k • z\i)\ 2 } 
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domic spectroscopy, this expression is for historical reasons often referred to as 
Einstein A coefficient. 

If the initial state of the radiating one-electron atom is characterized by the 
ntum numbers n€m, the expression (23.52) can be summed over the magnetic 
ntum numbers of the final state, using the relations (19.72) and (19.78). Since 

|Ff(x)j 2 + |F?(y)| 2 4- |b?(z)| 2 = ^ (23.53) 

obtain for the spontaneous emission rate from the energy level n€ to n'f± 1, 



(23.54) 


^responding to the selection rule A€ = €'—€= ±1, which is the same here as 
Section 19.5. 


Exercise 23.12. Using the hydrogenic wave functions (12.92), compute the 
ontaneous emission rate for the 2 p Is transition in the H atom. Evaluate its 

ciprocal, the mean lifetime of the 2 p state. 


Elements of Quantum Optics. Proper higher-order perturbation calculations 
i interacting radiation and matter fields, beyond the first-order processes treated in 
e last section, belong in the domain of quantum electrodynamics. However, if the 
atter field that creates and annihilates photons can be approximated by a prescribed 
assical current distribution, the dynamics of the radiation field is greatly simplified, 
id it is not always necessary to resort to perturbation expansions. This strategy is 
iccessful because the free electromagnetic field is representable as a system of 
dependent harmonic oscillators whose interaction with the matter field is linear in 
e photon creation and annihilation operators if the second, quadratic, term in the 
iteraction (23.41) is neglected. Such a semiclassical theory is thus equivalent to a 
astern of forced harmonic oscillators, for which we have already worked out exact 
)Iutions of the dynamical equation in Section 14.6. The theory that treats the matter 
eld classically, but the electromagnetic field as a quantum system, complements 
te approach taken in Chapter 19, where we considered the effect of a classical 
idiation field on an atom, albeit using perturbation theory. 

Here we suppose that the Hamiltonian of the radiation field coupled to a pre- 
:ribed classical current density j(r, t) is obtained from (23.39) by letting the ma- 
jrial carrier of the current become very massive (m — > °°) while keeping j(r, t) 
nite: 

K = ~ f E (_) • E (+) d 3 r - - f j(r, t ) • A(r) d 3 r 
2tt j c J 


(23.55) 
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This expression is transformed in the momentum representation as 
w = X fto> t [flJ(k)ai(k) + ai(k)a 2 (k)] 

k 

- V4rrh ^ — A= j(k, t) ■ [a 1 (k)g^ 1) + 

4 - VM l=j'*(k, 0 • [a{(k)ek ) + a f 2 (k)e ( k 2) ] 

k \2(x) k 

where 

j(k, 0 = pi J j(r, t)e ik r d 3 r 


(23.56) 


(23.57) 


is the Fourier transform of the applied current. 

The Hamiltonian (23.56) is a sum of independent contributions from each pho- 
ton mode with wave vector k and definite polarization. Omitting, for brevity, all 
reference to the particular mode, each addend is of the form 

hcoa^a + f(t)a + / *(r)n t 


which, except for the missing zero-point energy, is the same as the Hamiltonian 
(14.106) for the forced linear harmonic oscillator. The complex-valued function of 
time f(t) stands for 

/(») = - (23.58) 

V ft >k 


Our interest will be focused on the changes produced by a pulsed current distribution 
which is effective only during a finite time interval but vanishes in the distant past 
and the remote future as t ±°°. It is convenient to assume that f(t) # 0 only for 
T 2 > t > Ti > 0. Therefore, the electromagnetic field is free before T j and after T 2 - 
Since the current is assumed to be known and susceptible to a classical descrip- 
tion, each photon mode develops independently in time, and the calculations of 
Section 14.6 become directly applicable. If the electromagnetic field is in its ground 
(or vacuum) state before the onset of the interaction, the final state will be 


S|0) = exp 


f- 

k.A no ) k 


|;'(k, (o k , A)] 



j(k, (o k , A)ai(k) 


| 0 > 

(23.59) 


in the interaction picture. Here we define the Fourier integral, 

;(k, oj k , A) = J_ + J dt I d 3 r e i(Q>t ~ k r) j(r, t) ■ e^ A) (23.60) 


Exercise 23.13. Verify the result (23.59) by factoring S in (14.143) and using 
it for each independent mode. 

The final state (23.59) is a direct product of coherent states — one for each 
excited mode — since for each single mode, 

e“ a+ 1 0) = <?l“l 2/2 | a) 
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e eigenvalues a are proportional to the Fourier transforms of the current density, 
'.60). The state is also an eigenstate of the positive-frequency operators A (+) (r) 
1 E (+) (r). It can be verified that the expectation value of A(r) in the state (23.59) 
identical with the classical retarded vector potential generated, in the Coulomb 
tge, by the prescribed current distribution. This is as it should be according to 
correspondence principle. 

As we saw in Section 14.6 on the forced oscillator, each coherently excited 
>de, with specified k and polarization, is populated by photons according to a 
isson distribution. The successive emissions of photons from a prescribed current 
tribution may, therefore, be regarded as statistically independent events, and this 
navior is relevant in quantum optics and quantum electronics. 

Exercise 23.14. Show that the total probability for emitting from a prescribed 
issical current distribution j(r, t ) a net total number of n photons into a specified 
; R of photon modes, and none into any other mode, is given again by a Poisson 
;tribution. 


P n (R) = e (23.61) 
n\ 

lere h is the mean number of photons emitted into all modes. Use the multinomial 
pansion for the calculation, and evaluate h and r in terms of the Fourier compo- 
nts of the applied current density. 


For large mean photon numbers, coherent states like (23.59) are good approx- 
lations to the states of the electromagnetic field excited by an ideal laser. The most 
aracteristic feature of any coherent state |''P coh ) like (23.59) is the factorization 
operty for the expectation value of any normally ordered product of field operators: 

(^cohl n E^r,, ti ) n E (+) (r_y, r,)l^ coh > 

i J 

= FI <^ coh | E ( ~ ) (r i , h)|^ coh > FI <^coh|E (+) (r;, tj)\V coh ) (23.62) 

i j 

= n E' ( ->(r f , t,) n E' ( ->*(r,., tj) 

i j 

ere the E ,(_) (r, t) are eigenvalues of the field operators, corresponding to the 
)herent eigenstate l^oh) and propagating freely after the interaction has ceased, 
i particular, the energy density of the radiation field for a coherent state is propor- 
onal to 


<^cd,|E < - > (r) • E (+) (r)|* coh) 


< 1 I r coh |E ( - ) (r)|^ coh ) - <^ coh |E (+) (r)|^coh) 
E' (_) (r) • E' (+) (r) 


towing that in a coherent, or quasiclassical, state the field intensity may be cal- 
rlated by treating the expectation values of E (+) and E ( ~ ) like classical fields. 

For a single plane wave mode, the field is according to (23.22) effectively 


E (+) (r, t ) 


i\f2irha> k 


r 3/2 


a A (k)e^ ) e /(kr_ ^ ) 


(23.64) 


his is a single-mode field operator in the Heisenberg picture. By setting t = 0, we 
btain its Schrodinger picture version, assuming the field to be free. 
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Exercise 23.15. Calculate the expectation value of the full electric field 
operator 

E(r, t) = E (+) (r, t) + E { ->(r, t) 

in a single-mode coherent state. Show that the result resembles a classical plane 
wave. Also work out the square root of the variance (or uncertainty) of the field 
amplitude E • e^ t) 1 

Expectation values, like (23.62), of products of field operators for general states 
of the radiation field are significant for characterizing the measurable properties of 
a system of photons. The rate at which photons are usually detected at a point in 
spacetime through absorption by atoms or other systems of charges in the ground 
state is to a good approximation proportional to the squared transition matrix element 

|(^|E <+) (r, f)W| 2 (23.65) 

Here and denote the initial and final states of the radiation field, with 
having one photon less than M*-. An ideal one-photon detector is equally sensitive 
to all photons present at point (r, t) in the initial state. For such a detector, the sum 
of (23.65) over all final states of the radiation field may, by use of the closure 
property for a complete set of states, be written as 

2 |<^ / |E (+) (r, t)|^>| 2 = 2 OP.-lE^r, 0|^/)(^/|E (+) (r, t)\%) 
f f (23.66) 

- (%\E<~Xr, t ) • E (+) (r, t)\%) 

which is the expected field intensity at spacetime point r, t. In the next section, we 
apply these concepts to a quantum mechanical analysis of optical interference phe- 
nomena, and especially the familiar two-slit interference experiment. 

5. Coherence, Interference, and Statistical Properties of the Field. In an inter- 
ference experiment, two (or more) field modes are simultaneously excited, creating 
a state of the photon field as a superposition of several photon-number basis states 
| n u n 2 , . . . , n k , . . .), and the state of the field is probed at various points in space 
or time. We speak of interference when the observed photon detection rate differs 
from the sum of the individual intensities expected for the separate modes, and we 
attribute interference to coherence properties of the system. The term coherence has 
acquired many different meanings in physics, and caution is in order when using it. 
For example, a pure state that is a linear combination of two or more component 
states is sometimes redundantly called a coherent superposition. On the other hand, 
as a technical term we use the name coherent state generally for an eigenstate of an 
annihilation operator and, in the case of the radiation field, specifically an eigenstate 
of E (+) . In this section, the degree of first-order (spacetime) coherence of the field 
will be defined. 

Following the classical treatment of interference, two or more field modes that 
are excited are often thought of as distinct radiation fields that are being superposed. 
In quantum mechanics, such a view can easily and inadvertently lead to the erro- 
neous conclusion that the two or more components of a superposition always rep- 
resent distinct photons. In the interest of conceptual clarity, we regard the radiation 
field as a single spacetime-dependent operator, which is a superposition of all modes 
defined by appropriate boundary conditions, whether or not they are excited. If in- 
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■ference is observed, the particular state as much as the omnipresent field operator 
responsible for its occurrence. 

The art of (photon, neutron, electron, atom, or molecular) interferometry re- 
ires considerable skill to produce the desired superpositions of simple modes, and 
ich effort must go into this first stage of an interference experiment. In a typical 
oung-type) two-slit experiment, two neighboring modes are excited and made to 
terfere with amplitudes whose relative magnitude depends on the widths of the 
ts and whose relative phase is a function of the geometric arrangement (see Figure 
1). Up to this point, the analysis involving the field, with its modes determined by 
e solutions of Maxwell’s equations subject to appropriate boundary conditions, is 
entical with the procedure followed in classical optics. Coherent superpositions of 
;ld modes are also readily produced by many other techniques, such as amplitude 
beam splitters, and nonlinear optical devices that can change the number of pho- 
ns in the state. 

Assuming that the desired photon state has been generated, here we are con- 
:rned with the second stage of an interference experiment, the detection of photons 
a spacetime point. We first consider the common situation of effective two-mode 
;citation and write the relevant basis states as \n x , n 2 ), omitting for brevity all 
ference to the remaining inert modes that are not being observed and may be 
isumed to have zero photon occupation. If just one photon is present in the two 
odes of interest, the normalized state vector has the general form 

|¥> = cos 0| 1, 0) + e 1 * sin 0|O, 1) (23.67) 

he amplitudes cos 9 and sin 9 are determined by the experimental conditions, 
hich define the ^preparation of the state. For example, in a two-slit interference 
5tup with equal slits, 

1 

cos 9 = sm 9 = —7= 

V2 

ad the phase $ is determined by the angle of incidence of the plane-wave photon 
nto the plane of the slits. The relevant part of the field operator is 

E (+) (rt) u 1 (rt)e iSl(xt) a l + u 2 (rt)e iS * r,) a 2 (23.68) 

; is determined by the boundary conditions and the mode frequency, and is a func- 
on of the spacetime point r t at which the field is being probed. For simplicity in 
valuating the intensity, the vector polarization character of the field is ignored on 
le right-hand side and we pretend that the field is a scalar. The amplitudes « 1>2 (rt) 
re real and nonnegative, and the S 1>2 (rf) are real phases. (To emphasize the paral- 
fiism of space and time coordinates in the present context, we omit the comma 
etween r and t. This convention will be continued in the remainder of this chapter.) 
k-gain, we should resist the temptation of thinking of the terms in (23.68) as two 
ifferent radiation fields. 

The observed intensity is, apart from an overall normalization factor, 

f(rt) = (¥|E ( ->(rf) • E (+) (rr)|^> | afa x | ^> + ^|a|a 2 |^) 

+ 2 u x u 2 Re[e l6 ('^|a]'a 2 |' v P)] 

= ((I, 0[ cos 9 + (0, sin 0)[nfaja! + u 2 a\a 2 

+ u l u 2 (e ,s a}a 2 + e~‘ s a 2 a 1 ) ](cos 0|l, 0) + e ,(t> sin 0|O, 1)) 

= cos 2 0(1, 0|a[ai|l, 0) + w 2 sin 2 0(0, l|a|a 2 |0, 1} 

+ U\U 2 sin 29 Ree ,(cl>+S) (\, 0|aja 2 |0, 1) 
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The last term accounts for interference. The relative intensity can be simply written 
in the form: 


I(rt) = 1 + 


U\U 2 

u\ cos 2 6 + u\ sin 2 6 


sin 26 cos (0 4- 8) 


(23.69) 


The quantity 8 = 8 2 - <>! is the relative phase of the two modes that are relevant 
in this intensity measurement. In the conventional two-slit experiment, where owing 
to the method of photon production from a single source both relevant modes have 
the same frequency, the observed intensity is determined by the position of the 
detector that eventually destroys the state. As Figure 1.1 shows schematically; 
S = k(sj — s 2 ) is proportional to the difference of the path lengths, .?! and s 2 , between 
the detector and the two slits. To observe the intensity variations of the phase 4> + 8 
as a function of r t, the experimental arrangement must ensure that ( ufu 2 ) e~ tS varies 
sufficiently slowly in space and time. In a Young-type interference experiment, this 
is achieved by making the distance between the slits small and the distance between 
the slits and the detector large. The preceding analysis shows that in the simple 
architecture considered here a single photon can be said to be responsible for inter- 
ference, reminding us of Dirac’s famous remark that (under single-photon two-slit 
conditions) a photon interferes with itself. 

In general, however, if the state of the radiation field is suitably generated, more 
than one photon is involved in causing interference. For example, if we assume that 
two modes are excited into coherent, or semiclassical, many-photon states: 

l^) = l«i>® |a 2 ) (23.70) 

the intensity is 

I(rt) = («!, a 2 |(M 1 c“' s ‘af + « 2 c“' 52 «|)(M 1 e ,Sl (3 1 + u 2 e lS2 a 2 )\ a u a 2 ) (23.71) 

= M^lail 2 + «1 |o! 2 | 2 + 2« 1 w 2 | aia 2 |cos(S + f>) 

where <f> is now understood to be the phase difference between the coherent state 
eigenvalues and a 2 . If a, and a 2 are thought of as the classical radiation fields, 
formula (23.71) is the familiar result of classical wave theory. 

The intensity pattern is measured at spacetime point r t, but since two modes 
are involved, the interference measurement conveys information about the normal- 
ized correlation functions for the field at two appropriately chosen spacetime ref- 
erence points i*!?! and r 2 t 2 (such as the locations of the two slits). The first-order 
correlation function is defined as the dimensionless quantity, 


8 a \rih; r 2 t 2 ) = 


OPltf-WO • E (+) (r 2 r 2 )|^> 


V^jE^Wi) • E^WOW^lE^W,) • E <+) (r 2 t 2 )\V) 


(23.72) 


In a well-designed interference measurement, the active modes are experimentally 
distinguishable. Bearing in mind that the mode functions are idealizations approxi- 
mating classical wave packets of finite extent, we may suppose that the reference 
points effectively satisfy the separability condition 


Ui(r 2 t 2 ) = u 2 (r 1 t l ) ~ 0 


(23.73) 
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the two-slit experiment, the separate slits are conveniently taken as the reference 
ints in space, and the reference times have the retarded values 

t { — t and t 2 — t (23.74) 

c c 

fectively, the expression (23.68) can then be replaced by 

E <+) (rf) = E ( 1 +) (r 1 t 1 ) + E ( 2 + \r 2 t 2 ) -» M 1 (r 1 f 1 )« ia(ri,l) a 1 + u 2 {r 2 t 2 )e iS(x ^a 2 (23.75) 
Lder the conditions specified for Young-type interference, the interference term in 
; expression for the intensity is seen to be proportional to the normalized corre- 
ion function (23.72) for the field at the two slits at the reference times (23.74). 

The magnitude |g <1) (r 1 t 1 , r 2 f 2 )| is called the degree of first-order coherence of 
5 state of the radiation field. If we set (''P a | = | E c— > (r 1 ? 1 ) and I’P*) = 

+) (r 2 t 2 )l the Schwarz inequality (9.75) shows that 0 ^ |g (1) | < 1. If |g (1) | = 1, 
i field is said to possess first-order coherence. If 0 < |g (1) | < 1, we have partial 
herence, and g (1) = 0 corresponds to an incoherent field. 

If the excitation of all modes is represented by coherent photon states, as in the 
o-mode example (23.70), the factorization property (23.63) shows that the field 
s first-order coherence for all pairs of points in spacetime, justifying the term 
herent for the eigenstates of the annihilation operators. 

Exercise 23.16. Check that the one-photon two-mode states (23.67) are 
st-order coherent for all pairs of spacetime points. 

Exercise 23.17. Prove that the two-mode photon-number basis state | n l , n 2 ), 
th n 12 ¥= 0, has zero first-order coherence at points that satisfy the condition 
3.73).’ 

» 

A more complicated higher-order correlation function arises when we consider 
e joint probability for detecting a photon in each of two detectors at spacetime 
dnts iVj and r 2 r 2 . In lowest order of perturbation theory, the coincidence rate is 
oportional to 

<^|E ( - ) (r 1 ? 1 )E ( ->(r 2 ? 2 )E (+ fe 2 )E (+) (r 1 ? 1 )|^) 
hen the field operators in such a correlation function ard expanded in terms of 
Loton modes, we are always faced with the need to evaluate expectation values of 
irmally ordered products of creation and annihilation operators in equal numbers, 
lese correlation functions measure higher-order degrees of coherence. 

The measurement of second-order (temporal and spatial) photon correlations 
as pioneered by Hanbury Brown and Twiss. 6 For example, in a two-mode config- 
ation they experimentally determined, by photon counting in state )P, the differ- 
ice between the average of the product of the photon numbers, {ty\a\a 2 a 2 ci l \’ i $!'), 
id the product of the averages of the photon numbers for the two modes, 
7 !«{«! |'T f )('^|n|a 2 j^P). If the state of the field is coherent as specified in (23.70), 
ese products are equal and there is no correlation. By introducing the photon cre- 
ion and annihilation operators, 

a' ~ cos 9 a\ + sin de^al, a = cos 6 a x + sin de~ l<l, a 2 (23.76) 
e define, as the generalization of the one-photon state (23.67), an n-photon state 

K): 

|¥„> = A= (a t ) n |0> = —U (cos 9 a\ + sin 9e t +c%r |0, 0) (23.77) 

Vn! Vn! 


6 Loudon (1983), Section 6.4, and Baym (1969, 1981), Chapter 19. 
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Such a state can be produced by beamsplitting techniques from an n-photon state in 
a single mode. Using the binomial expansion, we obtain 


'?„> = 2 


Vnl 


=o k\(n - £)! 


cos* e sin" - * 0 e‘ (n- *^(aj)*(r4) n- *|0> 


(23.78) 


= 2 


n\ 


cos* 6 sin" - * 0 e Kn ~ m k, n - k) 


-o \k\(ri - k)\ 


The probability of finding k photons in mode 1 and n — k photons in mode 2 is thus 
given by the binomial distribution, 


I (k,n - k |^„)| 2 = 


n\ 


cos 2 * 0 sin 2 ^ - ^ 0 


(23.79) 


k\(n — k)\ 

Exercise 23.18, Show that 

(M / „| alail^n) — n cos 2 0 and \c^a x |)P„) - n sin 2 0 (23.80) 


and 


| ala^ai ty n ) — n{n — 1) cos 2 0 sin 2 0 


(23.81) 


From (23.80) and (23.81), the normalized second-order correlation function for the 
state | ty n ) defined by (23.77) is found to be 


= i -- 

| a[a l | | cfa 2 \ ^ n ) n 


(23.82) 


Since this expression equals unity for a coherent state, (23.82) shows the degree to 
which the state |'P‘„) lacks second-order coherence. 

Generally, the state of the radiation field is not a pure state |“'P), but is a m i xture 
described by a density operator p, as defined in Section 15.5. The correlation func- 
tions characterizing the radiation field are then given by the averages 


<E (_) (r 1 t 1 ) . . . E ( -\r n t n )E (+ \r n t n ) . . . E^WO) 

= trace[pE (-) (nb) . - . E (-) (r„rJE (+) (r u rJ . . . E (+) (r 1 t 1 )] 


The definition (23.72) of the two-point correlation function and the corresponding 
degree of first-order coherence must be appropriately amended by using traces to 
express the expectation values. 

As a concrete example (Figure 23.1), we consider light of the same frequency 
emitted from two point sources, A and B, and the detection of photons at points a 
and b. The four channels {Aa), ( Ab ), ( Ba ), and ( Bb ) linking the sources and detectors 
are labeled 1 through 4, respectively, and represent four orthogonal field modes, 
with their creation and annihilation operators, aj and a t (i = 1, 2, 3, 4). Source A is 
assumed to emit single photons into channels 1 and 2 with complex amplitude a. 
Similarly, source B produces photons in channels 3 and 4 with amplitude j3. Thus, 
the sources are assumed to produce the four one-photon states, a<4|0), aa\ |0>, 
J3a|j0), jSa^jO). (In a more general case, the four emission amplitudes could all be 
different.) 

The (unnormalized) density matrix needed to calculate the photon number 
counted by an ideal detector a is, in the basis spanned by states 1 and 3, 


Pa 


/|a| 2 af 3*\ 

U*J8 I/8IV 


(23.84) 
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gure 23.1. Schematic diagram illustrating a spatial Hanbury Brown-Twiss type intensity 
:erference experiment. The two independent light sources are A and B, and the two 
tectors are labeled a and b. The four channels connecting the sources and detectors are 1 
rough 4. 


the sources A and B are statistically independent (“incoherent”) and we plan to 
ilculate ensemble averages, (23.84) must be replaced by the density matrix for the 
isemble. Since the phases of a and (3 are random and uncorrelated, the off-diagonal 
atrix elements of p a average to zero, and we have 


Pa 


/M 2 0 \ 

V o w) 


(23.85) 


he field operator that pertains to detector a is proportional to 


E (+) (a) -» fll + e iSa a 3 


(23.86) 


ere we have assumed, merely for simplicity, that the amplitudes m 1>3 in the notation 
: (23.64) of the two modes are equal in magnitude (and have been absorbed in the 
/erall normalization), but that the relative phase 8 a is sensitively dependent on the 
jometry and the position of detector a, just as in classical optics. As before, we 
so ignore the vector polarization properties of the field. The 2X2 matrix repre- 
ssing the intensity operator at a is 

E <->(a) • E (+) (a) -► ( l j (23.87) 


ombining (23.85) with (23.87), we obtain for the suitably normalized average 
density: 


<4> 


= (E (-) 
— trace 


(a) • E (+) (a)> 

7l«l 2 0 \ 

LI o 1/317 




= | a\ 2 + |/3| 2 


(23.88) 


'hich is scarcely a surprising result. Under the simplifying assumptions we have 
lade, the average intensity (or photon detection rate) at detector b is the same: 

4> = <4>- 

We now extend the calculation to the less trivial evaluation of the second-order 
itensity correlation function, or average coincidence rate, for the two detectors: 


( 44 ) = (E ( -\a)E^\b)E (+ \b)E (+ \a)) (23.89) 


ince we are interested in the state that represents the presence of one photon at 
ach of the two detectors, the four relevant two-photon states are: 

a 2 < 22 <zi| 0 ), (3 2 aXa\\to ), a(3ala\ 1 0), a^a\a\\Q) 
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The (unnormalized) density matrix for the statistical ensemble (assuming indepen- 
dent sources) in this representation is 


4 


Pab 


a 4 

0 

0 

0 

0 

l/3| 4 

0 

0 

0 

0 

wm 2 

M 2 I/3| : 

0 

0 

wm 2 

M 2 |0| : 


(23.90) 


The field operator pertaining to detector b is analogous to (23.86): 


E (+) (fi) -» a 2 + e iS »a A 


(23.91) 


We use the operators (23.86) and (23.91) to construct the matrix representing the 
correlation operator for substitution in (23.83): 


E ( ~\a)E^\b)E (+ \b)E (+ \a) 


l 

e KS a +8 p ) 

e iS » 

e iS ° 

— i(8 a +8 h ) 

1 

e -iS a e ~iS b 

e~ iSb 

e iS ° 

1 

e i(S a -/3 b ) 

g- lS a 

e iSb 

e -i(8 a -8 b ) 

1 


\ 

/ 


(23.92) 


The trace of the product of the matrices (23.90) and (23.92) is the correlation func- 
tion (23.83): 


<44> = l«r + |]8| 4 + 2M 2 |j 8| 2 [1 + cos(S a - 8 b )] 
= (4X4) + 2H 2 |/3| 2 cos (8 a - 8 b ) 


(23.93) 


Here the intensity correlation is seen to depend on the phase difference 8 a — 8 b , 
which is determined by the wavelength and by the geometry of the source-detector 
configuration (Figure 23.1). The result (23.93) forms the basis for photon counting 
and intensity interferometry and is generally applicable to boson systems. 

Exercise 23.19. Check the correlation matrix (23.92). 


A particularly important mixed state is the state of thermal equilibrium, which 
is a special case of a chaotic state. A chaotic state is defined as the state of minimum 
available information or maximum randomness and entropy, subject to the constraint 
that it must reproduce a given value (N t ) for the mean number of photons occupying 
the mode i. The theory of Section 22.5 may be applied to the calculation of the von 
Neumann entropy S for each mode, but now there is a Lagrangian multiplier A, for 
each mode. Since we are considering only a single mode here, we temporarily omit 
all indices identifying the mode. Following the same arguments as before, we max- 
imize 


S — k\(N) - — k trac e(p In p) — k\(N) 
and obtain for the chaotic state the density operator 

A ctfa 



(23.94) 


(23.95) 


with the partition function, 

. Z = trace e _Aat “ = 2 ( n\e~ Xafa \n ) — - 


(23.96) 
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d the bosonic constraint 



(23.97) 


<r A - 


(N) 


(N) + 1 

tice (N) > 0, relation (23.98) requires that A > 0. 


(23.98) 


Exercise 23.20. Prove that the partition function (23.96) for photons is 

Z = (N) + 1 (23.99) 

'aluate the entropy as a function of (N). 


The density operator for a chaotic photon state may thus be expressed as 

(N) N 


m + 1 ) 


AM-1 


(23.100) 


Exercise 23.21. Show that 

P = 77K 2 <r A ’l»X»l = -j- f a) <a| (23.101) 

, \N) n= 0 TT\J\) J 

e equivalent representations for the chaotic-state density operator in terms of 
lOton-number and coherent states, respectively. 


Exercise 23.22. Prove that any pure or mixed single-mode state, represented 
a density operator p, has first-order coherence at all pairs of spacetime points. 


If a radiation field is in thermal equilibrium, as in a black-body cavity, all field 
ades (now labeled by the index i) have the same temperature, which is determined 
r the average total energy of the system. 

(H) = X fiaiiiajai) 


i thermodynamic grounds, we argue as in Section 22.5 that in thermal equilibrium 
temperature T the Langrangian multiplier for each mode must be identified with 


A, = 


h(x>i 

Tr 


(23.102) 


ibstituting this value in (23.97), we find that the average number of photons in the 
ode (k, A) in thermal equilibrium is 


(N k ) 


1 

e ha> k /kT _ | 


(23.103) 


lis is Planck’s distribution law for black-body radiation, from which the quantum 
eory took its start. 
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Problems 

1. Consider a system of photons with fixed propagation vector in the positive z direction. 
The one-photon basis states are linear polarization states in x and y directions, al | 0 ) 
and a*|0). 

(a) If a[ and al are the creation operators for positive and negative helicity pho- 
tons, respectively, express the photon spin operator 

‘ S z = h{ala L - a\a R ) 

in terms of the creation and annihilation operators for linear polarization. 

(b) Work out the commutation relations of S z with a\ and aj, and compare them 
with the commutation relations for the components of a vector operator. Show that 
S z commutes with the number-of-photons operator. 

(c) Construct the general two-photon state for this system. Derive the two- 
photon eigenstates of S z , and interpret them in terms of the helicity of the photons. 

(d) For an ensemble of unpolarized two-photon states, obtain the density matrix. 
How would the density matrix look if the two photons were distinguishable? 

2. Prove that in a hydrogen atom the radiative transition from the 2 S excited to the IS 
ground state cannot occur by emission of one photon. Outline (but do not attempt to 
carry through in detail) the calculation of the transition rate for two-photon emission. 
In the dipole approximation, show that the two photons are preferentially emitted in 
the same direction or in opposite directions and that the angular correlation function 
is proportional to 1 + cos 2 6, if 9 is the angle between the photon momenta and if 
the polarization of the emitted light is not observed. Estimate the order of magnitude 
of the lifetime of the metastable 2 S state. 

3. Evaluate the peak value of the cross section for electric dipole absorption by a linear 
harmonic oscillator in its ground state, assuming that the excited state is depleted 
only by spontaneous emission. Use Eq. (19.134). 

4 . Compute the total rate of photon emission from the initial n = 3 level of a hydrogen 
atom to the n — 2 level (B aimer alpha line) and the mean lifetime of the atom, 
assuming that initially the substates of the n — 3 level are equally populated. 



CHAPTER 24 


elativistic Electron Theory 


In this final chapter, we develop a condensed account of the relativistic 
theory of electrons, or other fermions with spin one-half and mass m. In 
the interaction with the electromagnetic field, positrons of charge e > 0 
are created simultaneously with electrons of charge —e. Turning history 
upside down, we begin with a consistent unified description of both 
particles and antiparticles in terms of a common electron-positron field, 
based on the free-particle Dirac equation. We extend the global 
symmetries of Chapter 17 to the relativistic theory and consider the 
discrete symmetries (spatial reflection, time reversal, and charge 
conjugation) and their interconnection. In a one-particle approximation to 
the quantum field theory, the Dirac equation for a (four-component) 
spinor wave function in an external electromagnetic field is seen to fit into 
the standard scheme of ordinary quantum mechanics. In the nonrelativistic 
limit, the Dirac equation reduces to the Schrodinger equation. The story 
ends with the relativistic theory of the fine structure of the spectrum of 
the hydrogenic atom. 

The Electron-Positron Field. A free relativistic electron or positron is char- 
erized by its linear momentum p and energy E p , which are related by 

E p = Vc 2 p 2 + (me 2 ) 2 (24.1) 

in the case of the photon, only a measurement of the component of the particle’ s 
n in the direction of the momentum is compatible with a sharp energy-momentum 
Tor. Hence, the electron or positron may have definite positive ( R ) or negative 
helicity. 1 

We introduce creation and annihilation operators for electrons (a t and a) and 
fitrons ( b 1 and b ) in the two helicity states, subject to the anticommutation 
ations: 

0*(P)4(P') + 4(P')«/?(P) = a L (p)al(p') + a[(p'K(p) 

= Mp)6kp') + H(p')Mp) (24.2) 

= b L (p)bl(p') + bl(p'yb L (p) = 5(p - p') 

[ other anticommutators of these eight operators are set equal to zero, partly as a 
^sequence of the fermion theory developed in Chapter 21, and partly (namely, for 
icommutators of a or a T with b or fi f ) as an assumption that will be seen to be 
isistent with the formulation of a unified electron-positron theory. 2 


1 Warning: In optics it is conventional to define positive (negative) helicity as left (right) circular 
arization of light. See Exercise 23.8. 

2 In Chapter 23, the photon momentum was restricted to discrete values by the imposition of 
iodic boundary conditions on the radiation field. For the electron-positron field, it is convenient to 
L — » oo from the beginning and allow all of momentum space for p. 
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The operators for the energy, linear momentum, and charge of a system of free 
electrons and positrons are easily written down: 

W = J E p [al(p)a R (p) + at(p)a L (p) + b f R (p)b R (p) + 4(p)4(p)] d 3 p (24.3) 

P = f P[4(PK(P) + al(p)a L (p) + 4(P)4(P) + b[(p)b L (p)] d 3 p (24.4) 

Q = e J L^4(p)«/?(p) - 4(pK(p) + bl(p)b R (p) + bl(p)b L (p)] d 3 p (24.5) 

Also, if the operator $ represents the total angular momentum of the electron- 
positron system, 

[$ • p, 4 (pXK(p) + \$ • p, 4 (p)!Mp) + [$■$, 4(p)]4(p) + [$■$, 4(p)]4(p) 

fl 

= 2 W R {p)a R (p) ~ 4(pK(p) + 4(p)4(p) - 4(p)4(p)] 

(24.6) 

is the spin component along the direction of the particle momentum p per unit 
volume in momentum space. 

The objective of local quantum field theory is to seek ways of expressing these 
physical quantities as volume integrals of local (density) operators so that the op- 
erators for the total energy, momentum, charge, and other additive physical quan- 
tities eventually appear in the form 

% = f ilt f (r)K*lf(r) d 3 r (24.7) 

where K is an appropriate one-particle operator. The field operators *]/(r) are again 
distinguished in the notation from their wave function relatives i/f(r). As explained 
in Chapter 21, they are defined in the usual manner as Fourier integrals for a trans- 
formation of the creation operators from the momentum to the coordinate basis. 
However, care is required in the construction of the Fourier coefficients as well as 
in the choice of the one-particle operators representing physical quantities. 

For example, it is formally possible to write the energy of the system of free 
electrons and positrons as 

3C = J i4(r)V-ftW 2 + (mc 2 ) 2 i Ji(r) d 3 r 

Such a choice was seen to be unsatisfactory in the case of photons because it implies 
a nonlocal expression for the energy density. For photons, this impasse led to the 
inference that a reasonable definition of a one-photon probability density in ordinary 
space cannot be given. 

In the case of relativi stic particles with mass, the same conclusion holds, al- 
though the expansion of \ /r —h 2 c 2 V 2 + (me 2 ) 2 in powers of V 2 shows that the non- 
local effects, which arise from the presence of arbitrarily high derivatives, disappear 
in the nonrelativistic approximation. 3 The goal of formulating a strictly relativistic 
one-particle theory is unattainable. 


3 The Foldy-Wouthuysen version of relativistic electron theory is based on the use of the square 
root operator for the energy, but it can be put in local form only by successive approximations. See 
Rose (1961), Schweber (1961), and Gross (1993), Section 5.7. 



4 


Chapter 24 Relativistic Electron Theory 


It is possible to produce a sensible field theory for particles with mass along 
nilar lines as was done for photons, and this is customarily done for bosons such 
pions (spin zero). Dirac’s discovery of the relativistic theory for electrons (and 
sitrons) showed that the field theory for fermions with spin one-half may be de- 
loped in a form that is strongly reminiscent of one-particle quantum mechanics, 
straightforward relativistic one-particle approximation thus becomes feasible for 
ch particles, and we will develop it in Section 24.6. 

In the language of quantum field theory, the essence of Dirac’s discovery is the 
nervation that the physical quantities (24.3)-(24.5) may be reexpressed in alter- 
te form by the use of the anticommutation relations and some simple changes of 
riables of integration, resulting in 


W = J E p [al(p)a R (p) + al(p)a L (p) ~ b R (-p)bl(-p) 

(24.8) 

- b L (-p)bt(-p) + C] d 3 p 


P = J p[4(pK(p) + al(p)a L (p) + b R (-p)bl(-p) 

(24.9) 

+ b L (-p)bi(~p) + C] d 3 p 


Q = e J [-4(p)«/?(p) “ al(p)a L (p) - b R (-p)b f R (~p) 

(24.10) 

~ b L (-p)bl(-p) + C] d 3 p 



[$ • p> 4(p)]a*(p) + \$ • p, 4 (p)]«l(p) - [$ • p, 4(-p)14(-p) 

- 1$ • p, 4(-p)]4(-p) = ^ [4 (p)«*(p) - 4(pK(p) 

- h*(-p)4(-p) + M-p)4(-p)] 

(24.11) 

If we momentarily disregard the constant terms symbolized by C, these ex- 
essions show that the annihilation operator for a positron, b L (— p), can also be 
terpreted as an operator creating an electron of momentum p but positive helicity 
Ld negative energy, — E p . Such negative energies appear quite naturally in a rela- 
/istic theory that relates energy and momentum by the equation 

El = c 2 p 2 + (me 2 ) 2 (24.12) 

lowing in addition to Eq. (24.1) the solution — Vc 2 p 2 + (me 2 ) 2 . 

Using these clues, we construct a field operator 

i|/(r) - »J/ +) (r) + i|» (_) (r) (24.13) 

the sum of positive and negative frequency (energy) parts defined as 

* <+)(r) ^ / [« ( >K(P) + w aj (P)«z.(P)]4 M)pr d 3 p (24.14) 

^"V) = o l V 3/2 [ [^ w (-p)4(p) + v (L \-p)bl(p)]e~< mvr d 3 p 

(2irh) J (24.15) 

= / b ( »4(-p) + v {L \p)bt(-p)]e^ mpr d 3 p 
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The coefficients u w ( p), n a) (p), i/^p), and */ (i) (p) are one-column matrices 
that must be orthogonal to each other, such that for a fixed momentum p, 


u <R) 'u (L) 


(u (R) * . . .) 




= 0 



and similarly, 

u (R)t u (L) = u (R)i v (R) = u <R)i V (L) = u (L)i p (R) = w (i >V L) = v (R)t v (L> = 0 (24.16) 


Generalizing the terminology introduced in Section 16.3, we call these one-column 
matrices, with an as yet unspecified number of rows, spinors, and Dirac spinors on 
occasion when it is essential to avoid confusion with the two-component matrices 
of Chapter 16. We assume these spinors to be normalized according to the relations 4 


u <Rn u (R) = u <L)f u (L> = v (Ry] v <R) = V (m v (L) = 1 


(24.17) 


The field operators i|i(r) and »^ + (r) are similarly spinors, with components car- 
rying a yet to be determined number of spin indices. If such spinors can be found, 
the total linear momentum and the total charge of the system can be written as 


P - *|/ f (r) - Vi}/(r) d 

J i 


Q 


__ e f 
2 . 


[^(rMr) — *Kr)i|i + (r)] d 3 r 


(24.18) 

(24.19) 


where the symbol ~ indicates matrix transposition. 

Three comments are in order: (1) The formula (24.18) represents the expression 
(24.9), but we omit the constant term in the integrand, which merely ensures that 
the vacuum has zero momentum and is not needed if all momenta are measured 
relative to the vacuum. (2) Equation (24.18) for the linear momentum has the same 
form in the relativistic as in the nonrelativistic theory because, as indicated by Eq. 
(2.28), ( h/i ) V represents the three spatial components of a relativistic four-vector. 
Angular momentum is made relativistic in a similar straightforward manner (Section 
24.3). (3) The charge operator (24.5) was constructed to have zero expectation value 
in the vacuum, defined as the state in which there are no electrons and no positrons 
with positive energy. The peculiar form of Eq. (24.19) arises from rewriting (24.5) 
more symmetrically as 


Q = “ j [a£(p)a*(p) “ «*(p)4(p) + 4 (p)«l(p) - «l(p)4(p) 

- 4(p)4(p) + 4 (p)4(p) - 4(p)4(p) + Mp)4(p)1 d 3 P 


If we start from (24.10), the charge operator can be expressed in terms of the field 
as 


Q = -e j »Ji t (r)iJ/(r) d 3 r + e j <0|»|i + (r)^(r)|0) d 3 r (24.20) 


4 Warning : A variety of different normalizations for Dirac spinors are current in the literature. 
The main reason for making a different choice is that one often prefers a Lorentz-invariant normaliza- 
tion, which (24.17) is not. 
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quantum field theory, it is advantageous to define a normal-ordered, product or 
:k product : . . . : of the two fermion field operators ijf(r) and iff + (r') such that it 
i vanishing vacuum expectation value: 5 

-:iKr>|/ + (r'): = :^ + (r'>Kr): = *V>Kr) - <0|^(r')^(r)|0) (24.21) 

ere the minus sign is due to the anticommutation relations. With this notation, 
write the total charge operator simply as 

Q= ~e j 4 + (r)iKr): d 3 r (24.22) 

3 equivalence of (24.19) and (24.20) can be directly established after the anti- 
nmutation relations for the field are obtained in the next section. The compact 
mula (24.22) shows how close we can come to our stated objective of expressing 
iitive physical quantities as integrals over local densities. Except for the appear- 
:e of the normal-ordered product, which is an essential feature of relativistic the- 
-, (24.22) indeed looks like (24.7). 

Exercise 24.1. Show that the vacuum expectation value (0 1 t|/ t (r)iji(r) |0) is 
mite. 


The Dirac Equation , It remains to show that the energy of the system can also 
written in the form 

»Ji t (r) H i|i(r) d 3 r + const. (24.23) 



bstitution of the fields (24.14) and (24.15) in this integral shows that this goal 
l be accomplished if we require that 

Hu (R ' L \ p)e (l/fi)p r - E^X p)e 0/ft)p r (24.24) 

Hv (R ' L Xp)e mi)pr = ~E p v (R ’ L X p)c (i/ft)p r . (24.25) 

(24.23) is to be an integral over a localized energy density, the requirements of 
rentz invariance make it mandatory to seek a Hamiltonian that is linear in the 
ferential operator V. Therefore, we attempt to construct H in the form 


ft 

H — ctx • — V + fimc 1 

i 


(24.26) 


ving the constant square matrices a x , a y , a z , and /3 as yet undetermined. With 
s choice for H, Eqs. (24.24) and (24.25) reduce to 

(cot • p + pmc> ( *’ L) (p) = E p u (R ' l X p) (24.27) 

(cot ■ p + pmc 2 )v (R ' L) (p) = -E p p (R ’ l X p) (24.28) 

ice the eigenvalues ±E P are real and the eigenspinors orthogonal, the operator- 
itrix 


H p = cot • p + j 8mc 2 


(24.29) 


5 See Mandl and Shaw (1984). 
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must be Hermitian. Thus, a and (3 are four Hermitian matrices. They must be at 
least four dimensional (four rows and four columns) if H p is to have four orthogonal 
eigenspinors, and they should be no more than four dimensional if the description 
of electrons and positrons in terms of momentum, energy, and helicity is complete. 

Since the eigenvalues of H p are to be E p and ~E p , with each of these being 
doubly degenerate, all four eigenvalues of ( H p ) 2 must be equal to E 2 , hence 
(Hp) 2 — E p 1 and Trace H p — 0 is required. If we take the square of (24.26) and use 
the relation (24.12), we thus obtain the conditions 

= ot 2 y = a\ = p 2 = 1 

a x a y + a y a z = a y a z + a z a y — a z a x + a x a x 

- a xP + - Oiyfi + P<x y = « z /3 + (3ot z 

Our problem thus reduces to a purely algebraic one of finding four-dimensional 
Hermitian matrices with the properties (24.30) and (24.31). Pauli proved that all 
matrix solutions to these equations for a and are reducible by unitary transfor- 
mation to one another. Hence, it is sufficient to determine one particular 4X4 
solution and show that all traces vanish. 

Exercise 24.2. Using only the conditions (24.30) and (24.31), prove that the 
trace of a x , a y , and a z , and (3 vanishes, and show that each of these matrices has n 
eigenvalues +1 and n eigenvalues —1, where 2 n is the dimension of the matrices. 

Exercise 24.3. From (24.30) and (24.31), prove that a and are at least four 
dimensional. 


(24.30) 

(24.31) 


The most widely used representation of the a and (3 matrices are the 4 X 4 
matrices specified in terms of the 2X2 Pauli matrices of Section 16.4: 





(24.32) 


Every element in these 2X2 matrices is itself to be understood as a 2 X 2 matrix, 
so the matrices a and (3 are 4X4. We refer to (24.32) as the standard represen- 
tation. 

Exercise 24.4. Verify the validity of the solutions (24.32) to the problem 
posed by conditions (24.30) and (24.31). 

The discussion of this section so far leaves unidentified the Hermitian matrix 
that represents the helicity. Such a matrix must commute with H p and distinguish, 
by its eigenvalues, the two helicity states R and L. It will be readily identified after 
the angular momentum operator is obtained (Sections 24.3 and 24.4). 

The anticommutation relations for the field operators can now be derived from 
Eq. (24.2) and the remarks following this equation. The four eigenspinors M w (p), 
w (i) (p)> v (i?) (p), and ^ (Z,) (p) of the 4 X 4 matrix H p are orthonormal. Hence, they 
form a complete set of spinors, and the closure relation 
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ds. Using this relation, we can easily verify that 

(24.34) 


Exercise 24.5. Verify Eqs. (24.34). 

Exercise 24.6. Using (24.18) and (24.34), prove that 

W*(r), P] = Vv|*(r) (24.35) 

lich is the spatial companion of the Heisenberg equation of motion for the field 
erator. 

From the equations of motion for the creation and annihilation operators, the 
ae development of the free Dirac field is deduced by use of the Hamiltonian (24.3). 
e obtain in the Heisenberg picture 

4» (+) (r, t) = J [w w (p)a R (p) + M a) (p)u L (p)]e ( ' /ftXp ' r_ V> d 3 p (24.36) 

* ( ~ )(r ’ 0 = (dhf 2 j [yW( P ) ^ ( ~P ) + ^ i) (P)^(-P)]^ )(pr+£pf) d3 P (24-37) 

Eqs. (24.24), (24.25), and (24.26) are applied, we see that both frequency com- 
ments of »|f and the total field itself satisfy the field equation 

(24.38) 

tiis equation, which is the analogue of the time-dependent Schrodinger equation of 
anrelativistic quantum mechanics and of Maxwell’s equations for the electromag - 
jtic field, is known as the Dirac equation of the electron. The Dirac equation 
L4.38) can be cast in a more appealing form, particularly suitable for discussion of 
orentz covariance, by the introduction of a new set of 4 X 4 matrices, known as 
le Dirac y matrices: 

y° = jS, y 1 = pa x y 2 = poty, y 3 = P<* z (24.39) 

Exercise 24.7. Show that the three “spatial” matrices, y 1 , y 2 , y 3 , are anti- 
ermitian. In a transparent notation: y + = ~y. 

Using relativistic notation, the metric introduced in Section 23.2 and the sum- 
lation convention, with Greek indices running from 0 to 3, we may rewrite Eq. 
24.38) in the compact form 

(24.40) 

Ve have abbreviated the inverse of the Compton wavelength of the electron as 

.. — 
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and denoted 


r b = y0 i? + r ' v = 


1 d 

P~ - + P°L -V 
c dt 


The conditions (24.30) and (24.31) may be summarized as anticommutation relations 
for the y matrices: 


y'V + y v y* = 2 g» v 1 


(24.41) 


The one-particle differential operator that represents energy-momentum is given by 


P = 

= 



3 h _\ 
— T V 
c dt i ) 



For electrons with charge q — — e (e > 0), the presence of an external electromag- 
netic field, acting on the matter field, is as usual taken into account by the replace- 
ment 


p* -» p* + - A* 

c 

This gauge principle defines a minimal interaction of the Dirac spinor field with the 
vector field 


A* = (<f>. A) and A M = (0, -A) 

The substitution changes the Dirac equation from its free field form into 

id ie \ 

A^-^r + iK * =1> 


or in the noncovariant form, analogous to Eq. (24.38), 


ih 


O ' 

dt 


COL 


' i. N 

n _ e . 

— v -j — x 

, i c J 


ecf> + ftmc 2 


i|i(r, t) 


(24.42) 


(24.43) 


It is useful to define an adjoint Dirac field operator by the relation 

Vj, = 

Since y is antihermitian and y° Hermitian, Hermitian conjugation of Eq. (24.42) 
and multiplication on the right by y° leads to 


( d ie \ — — 

(i? + Tc a ») ~ iK,>> = 0 


(24.44) 


If this equation is multiplied on the right by i|/ and Eq. (24.42) on the left by i|i, and 
if the resulting equations are added to one another, the continuity equation 


— (r|ry^) = 0 


(24.45) 
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obtained. Similarly, it is easy to prove the further continuity equation 

~ (*j/y^f) = 0 (24.46) 

>mparing these expressions with the total charge operator (24.19) or (24.22), we 
fer that the electric current density four-vector of the electron-positron system is 
fined by 


F = ( cp,j ) = -— (i|/y M i|/ - ^r M tl/) = ~ec:ty y^: (24.47) 

lis operator is often simply referred to as the four-current. Conservation of charge 
ensured by the continuity Eqs. (24.45) and (24.46), or 

dr 

— = 0 (24.48) 

dx» 

nlike the nonrelativistic current density, which explicitly depends on the vector 
)tential A (see Exercise 4.17), A does not appear in the definition (24.47). Of 
>urse, it affects the current indirectly, since the field operator is a solution of the 
irac equation (24.42), which includes the electromagnetic potential. We will relate 
e relativistic formulation to the nonrelativistic limit in Section 24.8. 

Exercise 24.8. Derive the continuity equations. Show that the current is a 
ermitian operator, and, using the anticommutation relations (24.34), verify the 
quality of the two expressions for the conserved current in (24.47). 

. Relativistic Invariance. Unlike the relativistic invariance of Maxwell’s equa- 
ons for the free radiation field, even in quantized form, which needs no proof since 
le Lorentz transformations were designed to accomplish just this purpose, it is 
ecessary to demonstrate that the Dirac theory is in consonance with the demands 
f special relativity. Specifically, the requirement of invariance of the theory under 
lhomogeneous Lorentz (or Poincare) transformations will serve as a guide in es- 
iblishing the transformation properties of the electron-positron field. The general 
leory of the irreducible representations of the Lorentz group contains all the rele- 
ant information, but if nothing more than the transformation properties of a special 
eld is desired, the mathematical structure may be deduced from simple physical 
onsiderations. 

Einstein’s restricted principle of special relativity postulates the equivalence of 
hysical systems that are obtained from each other by geometrical translation or 
otation or that differ from one another only by being in uniform relative motion, 
according to Section 17.1, such equivalent systems can be connected by a unitary 
ransformation of the respective state vectors. 

The principle of relativity is implemented by constructing the coordinate trans- 
ormation 


x'* = aT v x v + b M (24.49) 

vith real coefficients aT v and subject to the orthogonality condition 


dx'^dx'^ = dx v dx v or aT x aF = 81 


(24.50) 
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In addition to the proper orthochronous Lorentz transformation for which 

det = 1, a° 0 ^ 1 

the orthogonality condition allows improper Lorentz transformations such as space 
reflections and time reversal, as well as combinations of these with proper ortho- 
chronous transformations. Although there is no compelling reason to expect that the 
coverage of the principle of relativity extends to the improper Lorentz transforma- 
tions and those feversing the sense of time, it is important to investigate whether 
the proposed theory is invariant under the totality of the transformations licensed 
by the orthogonality condition (24.50). 

It is a fundamental assertion of local quantum field theory that if an active 
Lorentz transformation takes the point (r, 0 into (r', t ') and changes the state T' 
into a state t/\P, where U is unitary, the components of i|/(r', t')Uty must be related 
by a linear transformation to the components of tAJ»(r, 0^- Hence, the field must 
transform as 


i|>(r', = SlA|f(r, 

or 


[/^(r', t')U = S»Kr, t) (24.51) 

The 4X4 matrix S defines the geometrical transformation properties of the spinor 
whose components, like those of a vector or tensor, are reshuffled in this symmetry 
operation. [Compare (24.51) to Eq. (17.83).] It is assumed that the vacuum state is 
left unchanged by a symmetry transformation: {7\P' (0) = 

We first consider three-dimensional rotations as a subgroup of the Lorentz trans- 
formations. From the definition of rotations, it follows that we must expect the 
relations 

Ua R , L ( p) = a R Jp')U, Ub RJ ,(p) = b RiL (p')U (24.52) 

to hold, with p ' being the momentum vector that is obtained from p by the rotation. 
Since p • r = p ' ■ r ' and since the integral over the entire momentum space is in- 
variant under rotations, it follows from Eqs. (24.13), (24.14), (24.15), and (24.52) 
that condition (24.51) will be satisfied if we determine the matrix S such that 

u (R - L \p’) = Su (R ' L) (p) (24.53) 

z/*' L) (p') = Sv (R ’ L \p) (24.54) 

Since E p is invariant under rotations, the last two equations in conjunction with 
(24.27) and (24.28) imply the condition 

H p -S = SHp (24.55) 

where 

H p = ca • p + j 8mc 2 = /3 icy ■ p + me 2 ) 

If we write 

P ,k = a k jp j , p k = p\a\ 

with summations over repeated Latin indices extending from 1 to 3 only, substitution 
into (24.55) produces the conditions 

/3/S = SjByV* 


j3S = S/S 



2 


Chapter 24 Relativistic Electron Theory 


d 


a e k7 k = s-'yts 


(24.56) 


7° = S _1 7°S (24.57) 

The conditions (24.56) and (24.57) for the matrix S are included as special 
ses in the general condition that S must satisfy if the electron-positron field theory 
to be invariant under all (homogeneous) Lorentz transformations: 


= s- VS 


(24.58) 


though this condition may be obtained by generalizing the argument that we have 
ven for spatial rotations as active transformations, it is easier to derive it by taking 
3 passive point of view and requiring that the Dirac equation (24.40) must be 
variant under the transformation: 


d 


x'» = aE v x v , 


d 

dx lx 


dx 


tv a V- 


(24.59) 


*[»'(«•', t') = S»Jr(r, /) 


(24.60) 


Exercise 24.9. Derive condition (24.58) from the Lorentz invariance of the 
irac equation. 

# 

The demonstration of the Lorentz invariance of the theory will be complete if 
e matrix S can be exhibited for each possible Lorentz transformation. The explicit 
instruction of S for proper orthochronous Lorentz transformations, which can be 
stained continuously from the identity operation, is most easily accomplished by 
msidering the condition (24.58) in an infinitesimal neighborhood of the identity. 
r e may write 

a» v = S\ + s» v (24.61) 

ith the condition 8% = — sfi as an immediate consequence of the orthogonality 
mdition (24.50). For the case of spatial rotations, see Exercises 16.3 and 17.21. 

If an arbitrary Lorentz transformation represented by S is followed by an infin- 
esimal one, the composite transformation is represented by S + dS, and the infin- 
esimal transformation is represented by 

(S + dS)S~ l = 1 + dS • S" 1 
.pplying Eq. (24.61) to Eq. (24.58), we get 

(i - dS • S -1 )y A (i + dS ■ S -1 ) = y A + e A ,V 


r 


[y\ dS ■ S _1 ] = for all A 

he solution of this commutation relation is seen to be 

dS ■ S -1 = ~ 


(24.62) 


(24.63) 
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A three-dimensional rotation by an angle 8<p about an axis along the unit vector ii 
takes the position vector r, according to (11.10), into 

r' = r + 8(f) n X r (24.64) 

By comparison with Eqs. (24.59) and (24.61), the identification 

S 2 “ — S l — ~ 8(f) «3 = ~ £ 12 = £21 

4 e 2 3 = — £ 3 2 = ~8<f> Hi = -e 23 = £32 (24.65) 

£ 3 i = — £* 3 = —8(f> n 2 — — £31 = £13 

emerges. 

Exercise 24.10. Check one of the three equations (24.65). 

If we substitute the infinitesimal displacements in (24.63) and define the 
matrix 2 , 

2^ = ~ [y M , yl (24.66) 

Eq. (24.63) reduces to 

dS ■ S -1 = — ^ 8(f> (n{Z 23 + n 2 2 31 + « 3 2 12 )] 

This matrix differential equation has the simple unitary solution 

S = exp — ^ 4>{n{2r 3 + n z 2 31 + n 3 2 12 )] — (24.67) 

We have used the notation 

2, - 2 23 , 2 y - 2 31 , 2 Z = 2 12 (24.68) 

for the four-dimensional analogues of the Pauli spin matrices. 


Exercise 24.11. Show that the 4 X 4 matrices 2 defined by (24.66) and (24.68) 
satisfy the usual commutation relations for Pauli spin matrices. Show that in the 
standard representation (24.32), 


2 - 


ar 0 
0 a 


(24.69) 


If (24.64) is substituted into (24.51) and the integration over the rotation angle 
(f) is performed, the behavior of the spinor field under finite rotations (with t’ = t) 
is obtained: 


[/^(r, t)U — exp — i(f) n ■ X -7 V + ^ rjr(r, t) 


— exp 


- ( L + |s)]« r , 


(24.70) 


where L = r X (h/i)V, as in the one-particle coordinate representation. If the unitary 
operator U is expressed as 


U = exp[— {Hh)(f> n • $] 


(24.71) 
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follows from Eq. (24.70) that the Hermitian operator $ must satisfy the commo- 
tion relations 


0 , $\ = 


r x 7 v + f £ )w r * ^ 


(24.72) 


[rich are the rotational analogue of (24.35). The total angular momentum operator 



i|f(r) d 3 r 


(24.73) 


tisfies this equation. Similar to the total angular momentum for photons (Section 
>.2), the two terms on the right-hand side of (24.73) can be interpreted as orbital 
Ld spin angular momentum. 

Exercise 24.12. Verify that Eq. (24.73) is consistent with Eq. (24.72) and with 
e defining relation for helicity, (24.6). 

Since any proper orthochronous homogeneous Lorentz transformation may be 
>tained as a succession of spatial rotations and special Lorentz transformations, it 
iffices for the invariance proof to show the existence of S for special Lorentz 
informations. 


Exercise 24.13. For a special Lorentz transformation corresponding to uni- 
•rm motion with velocity v = c tanh x along the x axis, show that 



(24.74) 


ote that since 2 10 is antihermitian, the matrix S is not unitary in this case. [The 
titary operator U, which effects this transformation in accordance with Eq. (24.51), 
in again be constructed by starting from the infinitesimal transformation.] 


Exercise 24.14. 
eld. 


Discuss coordinate translations in the theory of the Dirac 


From (24.67) and (24.74) it is easy to deduce that the matrix S for all Lorentz 
ansformations has the property 


SVS = / 


(24.75) 


Exercise 24.15. Verify Eq. (24.75). For three-rotations, reconcile this result 
ith (24.57). 


Combining (24.58) and (24.75), we obtain 

SVVS = af t v y°y v 


(24.76) 


If the unitary operator U, induced by a Lorentz transformation, is applied to the 
irrent density (24.47), use of (24.51) and (24.76) shows that the current density is 
four-vector operator and satisfies the transformation equation 

U*r<X', t')U = a\f( r, t) 

i generalization of the concept of a vector operator defined in Section 17.7. 


(24.77) 





3 Relativistic Invariance 


605 


The study of proper orthochronous Lorentz transformations must be supple- 
mented by consideration of the fundamental improper transformations. Spatial re- 
flections will be discussed in the remainder of this section. The study of time reversal 
is left to Section 24.5. 

If spatial reflection of all three coordinates, or inversion, is a symmetry oper- 
ation for the Dirac theory, condition (24.58) implies that 

S _1 y°S = y°, and S _1 yS = — y (24.78) 

4 

From these equations and (24.51) for r' = — r, it follows that the current density 
(24.47) behaves as a four-vector under the action of the unitary inversion operator 
U P : 

Uln-v, t)U P = jflv, t) (24.79) 

only if S is unitary, S t S = 1. Except for an arbitrary phase factor, all the conditions 
imposed if S is to represent an inversion are solved by 

S = y° = (3 (24.80) 

and the inversion is thus accomplished by the relation 

t/piKr, t)U' P = y°i|i(-r, 0 (24.81) 

The unitary operator U P defined by this equation is the parity operator for the Dirac 
field. It is conventional to assume that the vacuum state is an eigenstate of U P with 
even parity. 

Exercise 24.16. Show that the current is a (polar) vector under coordinate 
inversion only if the matrix S is unitary. Attempt an explicit construction of the 
parity operator in terms of the field operators. 


It is convenient to define an additional Hermitian 4X4 Dirac matrix, 

y 5 = y 5 = iy°y 1 y 2 y 3 (24.82) 

which has the properties 

y^y 5 + y 5 y^ = 0 and (y 5 ) 2 = 1 (24.83) 

Exercise 24.17. Construct the matrix for y 5 in the standard representation. 

Derive the eigenvalues and eigenspinors of y 5 . Prove Eqs. (24.83) as well as the 
property 

[y 5 , 2H - 0 (24.84) 


Exercise 24.18. Verify the following transformation properties for the des- 
ignated bilinear functions of the field operators under proper orthochronous Lorentz 
transformations and under reflections: 


»Kr, *)*K r » 0 
Oy'Xr, 0 

»J*(r, O^AKr, t) 
*K r > 0y 5 y^(r, t) 
t|i(r, 0y 5 *ft(r, t) 


scalar 

vector 

antisymmetric tensor of rank two 
axial (pseudo-)vector 
pseudoscalar 


(24.85) 


The five kinds of bilinear field operators (24.85) provide the building blocks 
for constructing different interactions in relativistic quantum field theories. It can 
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; shown 6 that the set of matrices 1, y*, 2^ v , y 5 y tL , y 5 , which glue i|j and together 
these bilinear products, is complete in the sense that any arbitrary 4X4 matrix 
in be expanded in terms of these 16. They form the basis of an algebra of 4 X 4 
atrices. 

Solutions of the Free Field Dirac Equation. In this section, we derive explicit 
dutions of the Eqs. (24.27) and (24.28) which combine to: 

H p u(p) = (cot ■ p + fimc 2 )u( p) = Eu(p) (24.86) 

or this purpose, we will employ the standard representation (24.32) of the Dirac 
atrices. 

Exercise 24.19. Write out the four linear homogeneous equations implied by 
14.86) in full detail, and show that the vanishing of their determinant is assured by 
le condition E 2 — E 2 = c 2 p 2 + (me 2 ) 2 . Prove that all 3 X 3 minors of the scalar 
eterminant also vanish (but not all 2 X 2 minors), and interpret this result. 


The simplest solutions are obtained if the momentum vector p points in the 
irection of the positive z axis. In this case, Eq. (24.27) reduces for E = E p to 


r 


t me 2 

0 

cp 

0 

l«i\ 


( u A 

0 

me 2 

0 

-cp 

u 2 

= E n 

u 2 

cp 

0 

— me 2 

0 

w 3 

p 

u 3 

\° 

-cp 

0 

—mc 2 J 

\u 4 j 


l« 4 / 


mc 2 u 1 + cpu 3 = EpUx 
cpui — mc 2 u 3 = E p u 3 
mc z u 2 ~ cpu 4 = E p u 2 
— cpu 2 ~ mc 2 u 4 — E p u 4 


(24.87) 


Evidently, this system of equations possesses two linearly independent, and in fact 
rthogonal, solutions: 


1 ) 


1 0 

0 


1 

cp 

and u {L) oc 

0 

E„ + me 2 


-cp 

0 i 

\E P + me 2 } 


(24.88) 


'he labels R and L have been affixed to these spinors because they are eigenspinors 
f the one -particle helicity operator £ • p (here reduced to 2 Z ) with eigenvalues + 1 
nd —1, respectively. According to the formula (24.73), this is the component of 
ngular momentum along p for the particles of linear momentum p, since orbital 
ngular momentum contributes nothing to this projection (Exercise 24.12). 


6 See Rose (1961), Section 11. 
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The corresponding solutions for the eigenvalue E — —E p are 


/ ~ cp \ 

/ 0 \ 

E p + me 2 


cp 

0 

and v (L) cc 

E D + me 2 

1 


0 

0 1 


l > / 


(24.89) 


Exercise 24.20. Show that the only component of the matrix £ that commutes 
with H p is the helicity operator £ • p. 


Exercise 24.21. Determine the multiplicative constants for each of the four 
solutions (24.88) and (24.89), ensuring the normalization (24.17). 


Exercise 24.22. Show that the free-particle spinors for p in the z direction can 
also be generated by applying a Lorentz transformation (also known as a Lorentz 
boost) to the trivial solutions of (24.86) for a particle at rest (p = 0). Refer to 
Exercise 24.13 and Eq. (24.74). 


The eigenspinors with definite helicity but arbitrary linear momentum vector p 
are easily found by rotating the states described by (24.88) and (24.89) by an angle 
(f> = arccos (p z /p) about the axis determined by the vector {—p y , p x , 0). Such a 
rotation takes the z axis into the direction of p. The matrix operator that carries out 
this rotation is 


S = exp 


^ (~Py2 X + pfLy) 

‘2 vp, + ?, 


(24.90) 


Using the generalization of the identity (16.62) to the 4 X 4 Pauli matrices, we may 
write this as 


(f> ( Py 2 x ~ Px 2y) . (f, 

S = 1 cos — + 1 7 = sin — 

2 2 


(24.91) 


Hence, if the components of p are denoted as ( n x , n y , n z ), the rotated spinors are 

\ / 

<f> 


cos 


(Px + iPy ) . (f) 

Vp^7, " " 


sin 


cp 


E p + me 


cp 


<f> 

2 cos - 

Px + iPy 


OC 


n z + 1 
n r + in v 


cp 


sin — 


\E P + me 2 Vp^7 z 2 


E p + me 
CP 

E p + me 2 


(n z + 1) 


2 v-z 


(n x + in,,) 


M W (p) a: 


(24.92) 



Chapter 24 Relativistic Electron Theory 


(£), 


(P) 


-Px + iPy (f> 

sin 


Vp^7 z 2 

cos ^ 

2 


cp 


'Px + . <f> 


E p + me 2 Vp 2 - p 

cp 0 

^ cos — 

£ n + me 2 2 


, sm 2 


oc 


-n r + in v 


n 7 + 1 


cp 


E p + mc‘ 


cp 


E p + me 2 v z 


(— n* + in y ) 


(n z + 1) 


Exercise 24.23. Work out similar expressions for v {R) and v (L) . 
Exercise 24.24. Verify the closure relation (24.33). 


(24.93) 


The matrix 


B + (p) = w w (p)w w+ (p) + u a) (p)M (/ ' )t (p) (24.94) 


instructed from the normalized eigenspinors, gives zero when applied to an eigen- 
anor of cot • p + /3rac 2 with eigenvalue — E p \ applied to an eigenspinor of 
k • p + (3mc 2 with eigenvalue E p , it acts like the unit matrix. Hence, it can be 
^pressed as the ( Casimir ) projection operator 


B+(p) 


E p + COL • p + /8 me 2 
2 ~E P 


(24.95) 


milarly, the matrix 

E d — ca ■ p — Bmc 2 

B-(p) = v (R \p)v™(p) + ^( P y^(p) - — (24.96) 

2E p 

:ts as a projection operator for the eigenspinors with eigenvalue ~E P . These pro- 
ction operators are useful in many applications. 


Exercise 24.25. Show that for the eigenspinors of fixed momentum p and an 
bitrary 4X4 matrix A, 

M wt Aw w + u iL) 'ku {L) + v (R)t Av <R> + v (L) 'Av (L) = trace A (24.97) 


Charge Conjugation, Time Reversal, and the PCT Theorem. A simple rela- 
onship follows from the similarity of the equations satisfied by v ( * ,z,) (p) and 
^(p). If the complex conjugate of Eq. (24.28) is taken and p is replaced by — p, 
e obtain 

(-ca* • p + me 2 f3*)v iR,L) *(— p) = — E p v (R,L) *(— p) 
hich is to be compared with (24.27), 

(ca • p + me 2 /3)u (R,L \p) = E p u {R,L) ( p) 

' we can find a matrix C with the properties 

Ca* = aC, C/8* = — /8C 

is seen that Cv (R,L ^*(— p) satisfies the same equation as n (R,L) (p). 


(24.98) 
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Exercise 24.26. Establish that C *yC = y as well as 2C = — C2* and 

Q-lyOQ _ _^0 

Helicity is preserved under this transformation of solutions of the Dirac equa- 
tion. Indeed, from the equations 

2 • pO w *(-p) = — C (2 • p)*jt/ (R) *(— p) = (V ff) *(-p) 

2 • pC^ (L) *(— p) = — C(2 • p)*j/ L) *(— p) = — Cv iV> *(— p) 

if follows that the identification 

w w (p) = Cv ( *>*(-p) (24.99) 

and 

i w a) (p) = Cr ,(i) *(— p) (24.100) 

may be made. By using the same matrix C in both of these equations, we make a 
partial choice of the previously undetermined relative phases of the spinors u and 
v. The normalization (24.17) requires that 

M (*,z.)t(p) M (/?x)(p) _ p)C t Ci' ( *' i >*(— p) = v ( * X)+ (— pjCC*^’^— p) = 1 

or 

CC* = 1 (24.101) 

i.e., C must be unitary. In the standard representation (24.32), the conditions (24.98) 
and (24.101) are satisfied by the matrix 

C = $OL y = y 2 

Equations (24.14) and (24.15) show that M (/f,£,) (p) is associated with the anni- 
hilation of an electron and v iR,L) *(—p) with the annihilation of a positron. The con- 
nection (24.99) and (24.100) between these two amplitudes suggests that the unitary 
transformation that takes electrons into positrons and vice versa, without changing 
either momentum of helicity, may have a simple local formulation in terms of 
the fields. We define the unitary operator C, known as the charge conjugation or 
particle-antiparticle conjugation operator, by the equations 

Ca R (p)C' = b R { p) Cb R (p)C f = a R ( p) (24.102) 

Ca L ( p)C* = b L ( p) Cb L (p)C f = a L ( p) 

From Eqs. (24.14), (24.15), (24.99), and (24.100), it is seen that 

OK r)C = Cy°J(r) = Ciji t (r) (24.103) 

Exercise 24.27. Verify Eq. (24.103) and show conversely that 

Ci^rjC 4 = ^(r)C' 1 (24.104) 

The definition of C is supplemented by requiring that the vacuum state remain un- 
changed under charge conjugation: CjO) = |0). 

As time develops, the relations (24.103) and (24.104) remain applicable if the 
electron-positron field is free. This follows from the definition of charge conjugation 
and can be verified by showing that if r|/(r, t) and t) are connected at all times 
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' (24.103), the two Dirac equations (24.42) and (24.44) with = 0 imply one 
other. 

Exercise 24.28. Prove the last statement. 

In the presence of an external electromagnetic field ( A ^ + 0), the Dirac equation 
4.42) is no longer invariant under charge conjugation as defined by relation 
4.103). Applying this operation to Eq. (24.42), we obtain 

~ fc + iKC ^ = 0 

, by using the commutation properties of the matrix C, we reduce this equation, 
e find 

(24 • 105, 

This equation is the same as Eq. (24.44) except for the important change of the 
gn in front of the vector potential. The presence of an external field thus destroys 
e invariance of the theory under charge conjugation. At the same time, it is ap- 
irent that the invariance is restored if the electromagnetic field is regarded as part 
? the dynamical system and is reversed (A^ — A M ) when charge conjugation is 

jplied. 

Exercise 24.29. Reproduce the steps leading to Eq. (24.105), using the results 
f Exercise 24.26 and the properties of the Dirac matrices. 

Exercise 24.30. Show that under charge conjugation the current density op- 
:ator, defined in Eq. (24.47), changes into its negative if the anticommutation prop- 
:ties of the field are used. 

We now return briefly to the parity operator U P , defined in Eq. (24.81) by its 
;tion on the field: 


£/piKr, t)Up = y°«K— r, 0 (24.106) 

/e note that together with n (R,L) ( p) the spinors y°n (RvL) (— p), obtained by reflection, 
re also solutions of Eq. (24.27). Since 2 • p changes sign under reflection, the he- 
city is reversed, and y°w w (— p) must be proportional to u iL \ p). Similarly, 
,( V* ) (— p) must be proportional to v (L) (p). It is consistent with the relations (24.99) 
nd (24.100), and the condition y°C = — Cy 0 (Exercise 24.26) to set 

y°M W (-p) = u <L) (v) (24.107) 

ince y° is Hermitian, the equations 

w^p) = y°u (R \~ p) = y°Ci> (/?) *(p) = — Cy°^ (R) *(p) = — C[y°^ w (p)]* 


nd (24.100) lead to the conclusion that we must have 

yV ff) (-p) = -v (i) (p) 


(24.108) 
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N From (24.106), (24.107), and (24.108), we deduce the transformation properties of 
the electron and positron annihilation operators under spatial reflection as 

U P a R (p)U f P = a L (- p) U P a L (p)Ul = a R (-p) (24.109) 

U P b R (p)U P = -b L (~ p) U P b L {p)Ul = -b R (- p) 

The difference in sign between the equations in the first and second rows has im- 
portant physical consequences, since it implies that an electron and a positron in the 
same orbital states have opposite parities. 7 

We conclude this discussion of discrete symmetries with some remarks about 
time reversal. The general concepts needed for the discussion were already presented 
in Section 17.9. The antiunitary time reversal operator © is defined to reverse the 
sign of all momenta and spins. We therefore require that 

0^(p)© _1 = e iaR(p) a R (-p) ©^(p)© -1 = e iadP) a L (~ p) (24.110) 

©b R (p)© -1 = e‘ Mp) b R (-p) ©b L (p)© -1 = e‘ PL(p) b L (- p) 

Although the phases in (24.110) are arbitrary, it is possible to choose them in such 
a manner that the fields undergo simple transformations under time reversal. From 
the antiunitary property of ©, one may derive the transformation properties of the 
creation operators: 

®4(p)© _1 = e _i “« (p) 4(-p) ©ai(p)© -1 - e'~ iaL<p) al(-p) (24.111) 

Q&Kp)©" 1 = e- iMp) bl(- p) ©fci(p)©- 1 - e- ipL(p) bt(-p) 

Exercise 24.31. Derive (24.111) from (24.110). 

If we apply 0 to the fields defined in Eqs. (24.36) and (24.37), and make some 
trivial substitutions in the integrand, we obtain from (24.110), 

©ili (+) (r, 0© -1 = (2 irfif n I 

+ u (L) H~ p)e ia *- p) a L (v)]e (i,hXp r+Ept) d 3 p 

In arriving at this equation, the antiunitary nature of 0 is used, resulting in complex 
conjugation. The right-hand side of this equation becomes a local expression, 
Ti/f (+) (r, — t), for the time-reversed field, if a 4 X 4 matrix T can be found such that 

— p ) e ~ p) = Tu (R ’ l >(p) (24.112) 

The normalization (24.33) implies that T must be unitary: 

TT t = 1 (24.113) 

The relations (24.112) are consistent with the Dirac equation (24.27) only if T sat- 
isfies the conditions 


a*T = —Tol, j3*T = T/3 (24.114) 

The unitary solution to these equations is unique except for an arbitrary phase factor. 


7 For illustrations of the selection rules that can be derived for interactions invariant under reflec- 
tion and charge conjugation, see Sakurai (1967), Section 4-4. 
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Exercise 24.32. If T t and T 2 are two different unitary matrices that satisfy 
1. 1 14), construct TjT^ 1 . Show that this commutes with all Dirac matrices and 
ace must be a multiple of the unit matrix. 

In the standard representation (24.32), the imaginary matrix 

T = -ia z ot x = 

a solution. It has the important property 

TT* = -1 

iich can be proved to be independent of the representation. 

Exercise 24.33. Apply the time reversal operator to the negative frequency 

rt of the field, and show that the same matrix T may be used to transform i|/ (_) as 

+) 


(24.115) 

(24.116) 


It follows that the complete electron-positron field is transformed under time 
/ersal according to 


©i]i(r, 0© _1 = Ti|»(r, -t), ©iJiV, t)© -1 = V(r, ~t)V (24.117) 


Exercise 24.34. Show that TL • p = — £* 
der time reversal, substantiating (24.112). 


pT and that helicity is preserved 


The properties of T impose restrictions on the phases a and /3 in (24.110) and 
4.111). Iteration of Eqs. (24.112), in conjunction with the requirement (24.116), 
ves the result 


e ‘ a K,rJ-p)- ia K,i.(P) = giPit.d-Pl-'PRpP) = _ | 


(24.118) 


le effect of two successive time reversals can now be established. Owing to the 
tilinearity of ©: 

© 2 a^(p)® -2 - e -‘“AP)© a/? (_p)@-i = c -taR<P)+to*(-P) fljj (p) = -a R (p) (24.119) 

ius, application of © 2 merely changes the sign of the annihilation operator. The 
me conclusion holds for all other annihilation and creation operators. Hence, © 2 
ts like +1 on states with an even number of Dirac particles (and, more generally, 
rmions), and like —1 on states with an odd number of such particles. This con- 
usion agrees with the discussion of Section 17.9 (Kramers degeneracy). 

Double time reversal cannot have any physical consequences, and © 2 is a uni- 
ry operator that commutes with all observables. State vectors that are obtained by 
iperposition of states with even and odd numbers of Dirac particles undergo an 
lacceptable transformation under the application of © 2 and cannot be physically 
alized. This statement is a superselection rule, and it is consistent with a super- 
lection rule inferred in Section 17.2 from the commutivity of observables with 
itations by 27r: Fermions cannot be created or destroyed in odd numbers. 

In the presence of an external electromagnetic field, the time reversal operation 
generally no longer a symmetry operation. However, the invariance of the Dirac 
juation (24.43) under time reversal as defined by (24.117) is restored if A is 
langed into -A, while <J> is left unchanged. 
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Exercise 24.35. Determine the transformation properties of the Dirac current 
density operator under time reversal. 

In addition to angular momentum operators and other generators of the proper 
Lorentz group, we have now discussed three discrete symmetry operations corre- 
sponding to reflection, charge conjugation, and time reversal. Originally defined for 
free fields, these operations may remain symmetry operations when interactions are 
introduced. For iristance, quantum electrodynamics is invariant under each of these 
three operations. Weak interactions are not invariant under the three operations sep- 
arately. However, invariance still holds for the product (i.e., the successive appli- 
cation) of the three discrete operations (PCT theorem ), provided only that the re- 
stricted principle of relativity is valid. 8 

6. The One-Particle Approximation. In Chapter 21, quantum field theory was 
developed into a consistent description of systems of identical particles from the 
concepts of nonrelativistic quantum mechanics for a single particle. But, as was 
emphasized in Chapter 23 for photons and in Section 24.1 for electrons, there are 
obstacles in the way to constructing a relativistic form of local one-particle quantum 
mechanics in a rigorous manner. Inevitably, such a theory is an approximation to a 
proper many-body theory. 

It is tempting to identify the state 

^l(r)|0) - | r, a) (24.120) 

in analogy to Eqs. (21.3) and (21.6) as the one-electron state that corresponds to a 
sharp position of the particle with discrete quantum number a. The inadequacy of 
this identification in the relativistic theory is seen from the fact that, owing to the 
anticommutation relations, the state (24.120) cannot be normalized properly. The 
trouble stems, of course, from the properties of the field i|/ whose expansion contains 
both electron annihilation and positron creation operators, so that 

*a(r)|0> # 0 

An obvious possibility for remedying this difficulty is to use the positive frequency 
field, instead of Eq. (24.120), and to make the identification 

«!4 +,t (r)|0> = | r, a) (24.121) 

for a one-electron state. Such a theory, if pursued, would contain one-electron wave 
functions that do not correspond to any state in the original field theory (e.g., the 
negative energy eigenstates of H). This difficulty would make its appearance when- 
ever we encountered an operator, such as a strong potential energy V, that connects 
the “physical” with the “unphysical” states. Arbitrary exclusion of the “unphys- 
ical” states would violate the completeness requirements and lead to incorrect re- 
sults in calculations that involve virtual intermediate states. On the other hand, their 
inclusion would be embarrassing, since the theory then permits transitions to “un- 
physical” states if a strongly fluctuating perturbation is applied. 9 

The conclusion is inescapable that in relativistic quantum mechanics there can 
be no one-particle state that describes a particle at position r. Although we can say 


8 For further discussion of the PCT theorem, see Gross (1993), Section 8.7. 

9 For an interesting discussion of these difficulties, see Sakurai (1967), pp. 120-121. 
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iw much charge there is in a small volume in space, we cannot specify precisely 
iw many particles are located in this volume. On the other hand, we are able to 
>unt the number of particles with a specified value of momentum p, as we did in 
;ction 24. 1 . (The asymmetry between the descriptions in coordinate and momentum 
ace in the relativistic theory can ultimately be traced to the qualitative difference 
fiween time and energy: Time runs from — °o to +°°, whereas energy is always 
tunded from below.) 

Instead of modifying the field operators, we choose to formulate the one-particle 
iproximation by introducing a new “ electron vacuum ” \0e) such that 

4»«(r) | Oe) = 0 (24.122) 

fids. Obviously, this is a state in which there are no electrons but in which all 
mailable positron one-particle states are occupied. Thus, it can hardly be called the 
lysical vacuum, since relative to the no-particle vacuum it has infinite positive 
large and infinite energy. Nevertheless, departures from this state by the addition 
: one electron or subtraction of one positron can be treated effectively as one- 
irticle states. We thus define a one-electron state as 


\^ e ) = J *!» + (r, r, t) d 3 r |0<?> 


(24.123) 


here ip e (r, t ) is a spinor wave function — not an operator — with four components, 
rom the equal-time anticommutation relation, 

04»js( r\ t) + i|ip(r\ 0»l»£(r, 0 = S afi S(r - r') 

id Eq. (24.122), it follows immediately that the normalization 


= J t/4(r, 0<Ae(r, 0 d3r - 1 

t 

; applicable and that the electron wave function is 

•Mr, t) = <0tf|t|»(r, f)l^ e ) (24.124) 

l analogy to Eq. (21.57) in the nonrelativistic theory. 


Exercise 24.36. Verify that the total charge of the system in state of Eq. 
24.123) differs from the charge in the new vacuum state | Oe) by —e. 

Similarly, a new “ positron vacuum ” |0p) and a one-positron state |M r / ,) are 
efined by the equations 

_ i|i t (r)|0p) = 0 (24.125) 

|%) = J ij»(r, f)C~V P (r, t) d 3 r j Op) (24.126) 

? rom the normalization of these states to unity, the positron wave function iff p (r, t ) 
s obtained as 


t/f P (r, t) = (Op|Ci}i + (r, t)\%) 


(24.127) 
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In spacetime, the spinor wave functions i f/ e and tft p satisfy the same equations 
(24.42) and (24.44) as the field operators: 


(24.128) 

(24.129) 


These are the relativistic generalizations of the one-particle time-dependent Schro- 
dinger equation for particles with spin one-half. Dirac originally proposed the equa- 
tion bearing his name in the one-particle form (24.128). 

To recover the usual probability interpretation of one-particle quantum me- 
chanics, we consider an additive Hermitian one-particle operator, like the linear or 
angular momentum, which in Sections 24.1 and 24.2 was written in the form 

% = J ift f (r)Kxlt(r) d 3 r 

Here £(r, ~ihV) is a function of the position and momentum vectors and, in addi- 
tion, may also be a 4 X 4 matrix acting on a spinor. Using (24.123) and its inverse 
(24.124), we obtain for such an operator the one-electron expectation value 

(%, %%) = j tpl(r)Ktp e (r) d 3 r (24.130) 

Exercise 24.37. Show that similarly for a positron, 

(%, 3C%) = j if/l(r)Ktf/ p (r) d 3 r (24.131) 

if % represents a physical quantity that is invariant under charge conjugation and 
that therefore satisfies the condition 

C''KC = -K* (24.132) 

Verify that linear and angular momentum operators satisfy this condition, as does 
the free-particle energy operator. 


+ + iK ^ = 0 


Owing to the connection (24.99) and (24.100), if forces are absent, the wave 
function of a free electron or positron with momentum p and definite helicity is the 
plane wave, 


XR,L) 


(2irh) 


3/2 


'(P) 


mXpr-E„t) = 


(IttIT) 


3/2 


Cv (R ' L) *(-p)e a/mpr ' E p t) (24. 133) 


The approximate nature of the one-electron or one-positron theory is apparent 
in many ways. For example, the equation of motion for a free one-electron wave 
function, 


7* s* + «*. 


= 0 


has, for a given momentum p, four linearly independent solutions. Two of these, for 
positive energy, correspond to the two spin states of the electron. The remaining 




16 


Chapter 24 Relativistic Electron Theory 


no are eigenstates of H with negative eigenvalues and represent, according to the 
efinition (24.123), the removal of a positron from (rather than the addition of an 
lectron to) the “vacuum” |0e). These solutions cannot be ignored since, for a spin 
ne-half particle with mass, Lorentz invariance requires that if/ e have four compo- 
ents, so that four linearly independent spinors are needed to specify an arbitrary 
litial state. Even if initially the wave function were a superposition of only electron 
igenstates, the amplitudes of the positron components may, under the influence of 
Drees, eventually become appreciable. 

As a simple example, we imagine that the free Dirac electron is subjected to a 
erturbation during a time interval r. According to (19.23) of time-dependent per- 
jrbation theory, the transition amplitude for an exponential perturbation is 

-T<*|V|*> f * e^'e-l'l '*dt r = ~T ~ T 2 2 <fc|V|j) 

ft J — co Ti 1 i (OfoT 

’he optimum value of r is evidently r = l/co ks . For this value, the transition am- 
litude has magnitude \(k\ V\s)\lh(i> ks . Since hi o ks ^ me 2 for transitions between 
ositive and negative eigenstates of H, it is apparent that the one-particle approxi- 
lation breaks down when interaction energies of strength ~ me 2 fluctuate in times 
f the order ~ hlmc 2 . 

Translating these considerations from time into space language, we can say that 
f the potential energy changes by me 2 over a distance of the order hlmc, an initial 
ne-electron state may lead to pair annihilation of the electron with one of the 
ositrons present in the state |0e). Properly, it may thus be said that we are dealing 
/ith a one-charge rather than a one-particle theory. 

Understanding and accepting the one-electron Dirac theory as an approximation 
d a more accurate description involving interacting fields, we now ignore the many- 
>ody aspects and consider the relativistic wave equation, 


ih 


di (/ 
dt 


ca • f V + - A j — e<$> + fime 2 




(24.134) 


m its own merits. From here on, instead of writing i jj e , we omit the subscript e and 
imply use to denote the Dirac electron wave function. We emphasize again that 
t is a spinor function rather than a field operator. 

Just as in Section 24.2, we can derive the continuity equation, 

+ V • {c^cap) = 0 (24.135) 

dt 


iefining a probability density 


P = 


md the probability current density 


j = cif/aiff 


(24.136) 

(24.137) 


iquation (24.134) has the usual form 
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familiar from ordinary quantum mechanics, with H being a Hermitian operator. The 
only unusual feature of H is the fact that, unlike the nonrelativistic one-particle 
Hamiltonian and unlike the total field energy operator <l€, the one-particle H, 


H = ca 


h _ e ' 

- V + - A 
> i c j 


e<j) + fimc 2 


(24.139) 


is not a positive definite operator. Nonetheless, we will see in Section 24.8 that a 
simple correspondence exists between the relativistic and nonrelativistic Hamilto- 
nians. 

The stationary state solutions of the Dirac equation for a free electron (or pos- 
itron) need not be discussed in detail here, because this was in effect already done 
in Section 24.4. In the nonrelativistic limit, E p ~ me 2 , in the standard representation 

(24.88) , the third and fourth components of are small in the ratio of vie com- 
pared to the first two components. The converse is true for the spinors v (R ’ L \ as 

(24.89) shows. It is, therefore, customary to speak of “large” and “small” com- 
ponents of the Dirac wave functions, but this terminology is dependent on the rep- 
resentation used for the Dirac matrices. The stationary state solutions of (24.134) 
for a static central potential will be taken up in Section 24.9. 


7. Dirac Theory in the Heisenberg Picture. In quantum field theory, the time 
development of the field operators i(i(r, t ) in (24.38) was formulated in terms of the 
Heisenberg picture. When the transition to a one-electron theory is made, Eq. 
(24.134) for the spinor wave functions i/f(r, t ) is expressed in the Schrodinger pic- 
ture. We now study the one-electron theory in the Heisenberg picture, where state 
vectors are constant while operators that are not explicitly time-dependent evolve 
according to the formula 

dA(t) 

ih —T 2 = Mt)H(t) - H(t)A(t) - [A(f), H(t)\ 
dt 

With the understanding that we are working in the Heisenberg picture, for the pur- 
poses of this section only we may simplify the notation by writing A for A(t). 

The Hamiltonian operator has the form 

H — cot ■ ^p + - — e4> + fime 2 (24.140) 

where now not only r and p but also the matrices a and (3 must be regarded as 
dynamical variables, subject to time evolution. 

If the time derivative of r is defined as the velocity operator v, we obtain 

V = - yr [r, H] - cot (24.141) 

dt in 

Although in classical mechanics they are equal, in the relativistic quantum theory 
the operator mv formed from (24.141) is not the same as the kinetic momentum, 
p + {etc) A. To derive the equation of motion for the latter, we note that 
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d 


dA 

dt 


f + i lKH] 


1- ca ■ VA 

dt 


mce, combining the last two equations. 


d I c \ c 3A 

— ( p + - A = eV(p 4 h ea • VA — eV(at ■ A) 

dt \ c ) c dt 

= " eOL X ^ X 


— p -1 — A = — eE — eot X B = — <?E — e - X B (24.142) 
dt\cj c 

lis equation can be written in a form that is even more reminiscent of classical 
lysics if we note the identity 


a(H + e(f>) + (H + e4>) a = 2c p + - A 

V c , 


(24.143) 


hich shows the connection between mv = mca and p + (e/c) A. 


10 


Exercise 24.38. Carry out the details of the calculation leading to (24.142), 
mluating all requisite commutators. Verify the anticommutation property (24.143). 


Combining (24.142) and (24.143) with (22.141), we obtain a quantum mechan- 
al analogue of the Lorentz equation, 


d_ 

dt 


1 

2 


/ H + ec)) 


+ 


H + e(f) 





(24.144) 


' expectation values are taken, H + e4> can be replaced in the lowest approximation 
y me 2 or —me 2 , depending on whether the state is made up of positive or negative 
aergy solutions of the Dirac equation. Effectively, therefore, 



+e 



(24.145) 


The upper sign corresponds to a single electron with charge —e and positive 
nergy moving in the given external field. The lower sign corresponds to the motion 
f a single hole in the “sea of positrons,” represented by the electron “vacuum” 
tate | Oe), in the presence of the external field. Such a hole is equivalent to a particle 
f the opposite charge, i.e., an electron, with negative mass or energy, and Eq. 
24.145) is consistent with this interpretation. 

More insight into the significance of various operators can be gained if the 
article is moving freely, so that we may choose A = 0 and <fi = 0 everywhere. The 


10 The Foldy-Wouthuysen transformation (see note 3) sheds light on the pecularities that are en- 
ountered in quantum mechanics when nonrelativistic concepts, such as position and velocity of a 
article, are generalized to the relativistic regime. 
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Heisenberg equations of motion may then be solved explicitly. The equation of 
motion (24.142) reduces to 



and p = const. 


The free particle Hamiltonian is 

a H = cot • p + jS me 2 

and the equation of motion for the operator a becomes 

dot 1 2 

Since H = const., this equation has a simple solution: 

— - a(0 = cH- 1 p + e 2(,/fi)w [a(0) - cH~ l p] 
c 


(24.146) 


(24.147) 


The last equation can be integrated: 


r«) = r(0) + ^ t + ~ (e 2 <“>“ - 1) 
H 2 iH 


a(0) - ^ 
V ' H 


(24.148) 


The first two terms on the right-hand side describe simply the uniform motion of a 
free particle. The last term is a feature of relativistic quantum mechanics and con- 
notes a high-frequency vibration ( li Zitterbewegung ”) of the particle with frequency 
** mc 2 /h and amplitude h/mc, the Compton wavelength of the particle. Since for a 
free particle, as a special case of (24.143), 

(a - cpH~ l )H + H( a - cptf -1 ) = 0 (24.149) 

in a representation based on momentum and energy, the operator ot — cpH -1 has 
nonvanishing matrix elements only between states of equal momentum and opposite 
energies. Thus, the last term in Eqs. (24.147) and (24.148) is intimately connected 
with the appearance of the negative energy states in a relativistic theory that simul- 
taneously describes particles and antiparticles. 11 

It is of interest to note a few further operator identities. For a free particle 
Hamiltonian, 

PH + H/3 = 2 me 2 (24.150) 

and 

y 5 H + Hy 5 = 2cL • p (24.151) 


Hence, in a state of energy E, the operator p has the expectation value 




(24.152) 


so that (p) approaches ± 1 in the nonrelativistic approximation and vanishes as the 
speed of light is approached. 


n Sakurai (1967), pp. 117-119 and 139-140. 
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Similarly, 

<y 5 > = <2 ■ p) | (24.153) 

lowing that the expectation value of y 5 is v/c times the expectation value of the 
jlicity operator 2 • p. The operator y 5 is called the chirality. 

Exercise 24.39. Verify Eqs. (24.150)-(24.153) for free relativistic electrons, 
rove that the helicity 2 • p is conserved in free particle dynamics, but that the 
lirality is conserved only if the Dirac particles have zero mass. (This result has led 
» the term approximate chiral symmetry for a theory of a Dirac particle whose mass 
neglected.) 


The role of the spin in the one-electron Dirac theory is brought into focus if 
r e evaluate the time derivative of 2 for an electron exposed to a vector potential A 
ut no potential cf), so that 


H — cot • 



+ f3mc 2 


:y a sequence of algebraic manipulations, we obtain 


d2 

dt 


1 2c 

= — • [2, a X 

in n 



(24.154) 


(24.155) 


nd 


dt dt 



a 



a X I p + - A 

V c , 


+ 


r i oi 

r / e \1 

1 


a • ( p + - A 


L \ c )\ 

L V c /J 

J 


’he contents of the brace on the right-hand side of the last equation may be reduced 
o (eh/c)Z X (V X A) = (ehlc ) 2 X B. Hence, the simple relation 


dZ dZ 

H — + — H = — 2cc2 X B 
dt dt 


(24.156) 


s valid. In the nonrelativistic approximation H = me 2 , this equation becomes the 

fl 

;quation of motion for the one-electron spin operator S = — 2, 


dS 

dt 


S X B 

me 


(24.157) 


V straightforward interpretation of this equation may be given: The time rate of 
change of intrinsic angular momentum (spin) equals the torque produced by the 
ipplied magnetic field. If a magnetic moment |A is associated with the spin, the 
orque is p, X B. Comparison with (24.157) shows that in this approximation 
he magnetic moment operator for an electron is 



me 


e 

2 me 


g s S 


(24.158) 
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with g s defined in Eq. (17.101). Except for small radiative corrections, the value 
g s = 2, derived here from relativistic quantum mechanics for a charged Dirac particle 
is in agreement with the experimental measurements. 12 


8. Dirac Theory in the Schrodinger Picture and the Nonrelativistic Limit. In 
this section we return to the Schrodinger picture for the one-particle Dirac equation: 




d ie A \ , 

+ ««=() 


(24.159) 


A convenient second-order differential equation for ip is obtained by iterating this 
equation as follows: 


y( 


- -r- I - iK 1 1 Yl - t— 


he ^ / 


.K (24i6o) 


whence 


7 M 7l 


ie 


hc^)\dx v hc A V ) + K \ 
or, separating the terms with p = v from those with p + v. 


ip = 0 


[-( 


ie 


1 


ie 


-IV + -A] + ? --^1 


VC 


he 


ie „ , 

— a • E + k 2 tp = 0 
he 


In more elegant, but also more recondite, relativistic notation: 


(— 


ie 

~ hc A \ 


d ie \ 
he 


+ 

2 he ^ 


K 2 Ip = 0 


(24.161) 


(24.162) 


where 


dA„ dA 




ixv 


dx^ 


is the electromagnetic field tensor. 

Exercise 24.40. Derive Eqs. (24.161) and (24.162). 


In the absence of an electromagnetic field, the second-order equation (24.162) 
reduces to the Klein-Gordon equation , 

fe^ + K 2 V -(- v2 + ?S +K2 V = 0 <24 - 163) 

which also governs the wave function for a relativistic particle with spin zero. In 
the presence of the external field, Eq. (24.162) differs from the relativistic Schro- 
dinger equation for a scalar particle by the terms containing the Dirac matrices and 
coupling the wave function directly to the electromagnetic field B and E. 

The second-order equation (24.160) has more solutions than the Dirac equation, 
from which it was obtained by iteration; it is therefore necessary to select among 
its solutions x on ly those that also satisfy the Dirac equation. A convenient method, 
which also has physical significance, is to classify the four-component solutions of 


12 Quantum electrodynamics gives the value g s = 2(1 + a/2if) to first order in the fine structure 

constant Sialrnrai (\ Q67'\ 
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le second-order equation according to their chirality. Since the chirality y 5 is Her- 
litian and anticommutes with all y M , the solutions of Eq. (24.160) can be assumed 
> be simultaneously eigenspinors of y 5 with eigenvalues +1 or —1, so that 

yV (±) = ±X (±) (24.164) 

/e call such solutions chiral. Since (24.160) can be written as 



follows that the chiral solutions are paired by the reciprocal relation 

<24 165) 

ach chiral pair generates a “self-paired” solution of the Dirac equation: 

t/, = * (+) + (24.166) 

Exercise 24.41. Prove that (1 ± y 5 )/2 is the chiral projection operator and 
lat any solution of the Dirac equation can be uniquely decomposed into a super- 
osition of two solutions of the second-order equation (24.160) with opposite chi- 
ility, but that in general the two chiral solutions are not paired as in (24.165). 


Exercise 24.42. Show that chirality is conserved and that the Dirac theory 
xhibits chiral symmetry if the particle mass is zero. (See also Exercise 24.39.) 


In order to interpret Eq. (24.161), we assume that the external field is time- 
ldependent and we consider a stationary state solution 


<A( r, 0 

ubstitution into Eq. (24.161) gives 


- —(iih)Et 


u(r) 


1 


'h 


eh 


~ (E + e<f>) 2 - ( - V + - A ) - — (£ • B — ia • E) - mV 


u — 0 


’his equation is still exact. For a nonrelativistic electron, for which E =» me 2 , we 
pproximate 

\(E + e<j>) 2 — m 2 c 2 «= 2 m(E + ecf) - me 2 ) 


fence, we obtain 

(E + ecf) — mc 2 )u — 


1 (h e 
- — ( — V H — A 
2 m \ i c , 


eh 

+ 2^ (S ' B 


iot • E) 


(24.167) 


yhich is very similar to the nonrelativistic Schrodinger equation. In the absence of 
n electric field, this equation describes the motion of the electron in an external 
nagnetic field and again shows that an intrinsic magnetic moment as given by 
24.158) must be ascribed to the electron. 

The physical appreciation of the Dirac theory is further aided by rewriting the 
onserved four-vector current density 

j M — —ecif/ y^ijj 


(24.168) 
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in terms of operators that have a nonrelativistic interpretation. To this end, we write 
the Dirac equations for if/ and if/ as 


*- L s{& (24 ' 169) 

*=4(i ? + £**)** (24 - i70) 


Substituting (24.169) in one-half of the current density and (24.170) in the other 
half, we get 


- _ 


A =tix 



d ie , 
dx x + hc A ^r 




where the terms with /jl 4= A have been separated from those with /jl = A, and the 
anticommutation relation (24.41) is used. With the definition (24.66) of 2 MV , is 
finally transformed into a sum of polarization and convection terms, 

= jpol + jconv (24.171) 

where 




(24.172) 


and 


F - 

Jconv 


eci . — ( d ie \ 


a ^ , Vt 


•/' 1 (24.173) 


This procedure is known as the Gordon decomposition of the current density. Note 
that the electromagnetic potential, which is absent from the expression (24.168) for 
the current, now appears explicitly in the convection current. 


Exercise 24.43. Prove that both the polarization and convection currents are 
separately conserved, and show the relation of the convection current to the non- 
relativistic current density (Exercise 4.17). Evaluate the polarization and convection 
currents for the free particle plane wave states. 

Exercise 24.44. Show that, as in (3.7), a conserved transition current density 
can be defined to be 

j\i = -ecipty^ifo. (24.174) 

where both i//j and i// 2 are solutions of the same Dirac equation. 


9. Central Forces and the Hydrogen Atom. In an electrostatic central field, the 
one-particle Dirac Hamiltonian for an election is 

h , 

H = cot • — V + /3mc 2 — ecf> (r) 


(24.175) 
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nee L and S do not separately commute with the free-particle Hamiltonian, they 
rtainly will not commute with the Hamiltonian (24.175) either. However, the com- 
ments of the total angular momentum, 


h „ h „ 

J=rX T V+-S 

i 2 


(24.176) 


'idently do commute with H, and we may therefore seek to find simultaneous 
genspinors of H, J 2 , and J z . Parity will join these as a useful constant of the motion. 

At this point, it is convenient (though not unavoidable) to introduce the standard 
presentation (23.32) of the Dirac matrices and to write all equations in two-com- 
ment form. We introduce two two-component spinors cp x and <p 2 by the definition 


* = 



(24.177) 


lie Dirac equation decomposes according to Exercise 24.11 into the coupled equa- 
3 ns 


ft 

(j E — me 2 + e(f>)(p 1 ccr • V <p 2 = 0 

i 

ft 

( E + me 2 + e(j))cp 2 cor • V cpi = 0 

i 


(24.178) 

(24.179) 


here all cr are 2 X 2 Pauli matrices. The operators J z and 

J 2 = L 2 + fiL • 2 + 7 h 2 

4 


^compose similarly, and it is clear that we must seek to make both <p t and <p 2 
vo-component eigenspinors satisfying the conditions 




<Pi ,2 = tnh(p U2 


(24.180) 


ad 

^L 2 + KL • o- + ^ fi 2 ^j<p h2 = j(j + l)h 2 <p h2 (24.181) 

quation (17.64) contains the answer to this problem and shows that for a given 
alue of j the spinors <p x and <p 2 must be proportional to The two-component 

pinors ^ are normalized as J = 1 , and they have the useful property 

<T • r 1/2 = - qyjs 1/2 (24.182) 


Exercise 24.45. Prove Eq. (24.182) by using the fact that or • r is a pseudo- 
calar under rotation and that < 2 / has a simple value for 0 = 0 . 


Exercise 24.46. Prove that 

cr • L qyf 1/2 - (j - ^ mf - V2 

cr • L mf +in = (-J - 


(24.183) 

(24.184) 
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Since the parity operator also commutes with the other available constants of 
the motion (energy and angular momentum), it may be chosen as further “good 
quantum number’ ’ and we may require that 

= + ( <Pi( r )\ ( even ) 

\<P 2 (~r)/ ~ \<p 2 ( r)/ (odd) 


The parity of the eigenfunction clearly dictates how the spinors m are associated 
with cpi and <p 2 . It is seen that the two solutions must have the form 


/ F{r)Wj! l/2 \ 
\-ifm j ji V2 ) 


(24.185) 


or 


<A = 


/ cmf +V2 \ 

\-ig{rWjl m ) 


(24.186) 


Equation (24.185) has even or odd parity depending on the parity of j — 1/2. Equa- 
tion (24.186) has even or odd parity depending on the parity of j + 1/2. The factor 
— i have been introduced so that the radial equations will be real. 

In order to derive the radial equations, we employ the following identity 

1 ( ia • l\ 

or • p = — (<r -rXc • r)(a • p) - tr • rl r • p H j (24.187) 


which follows from (16.59). Here, 


P = 


h d_ 
i dr 


If we substitute the last two relations into Eqs. (24.178) and (24.179), and take 
Eqs. (24.182), (24.183), and (24.184) into account, we obtain the coupled radial 
equations, 


( E — me 2 + e(f))F — fici 1- )/ = 0 

\dr r ) 

( E + me 2 + e<f)) f + he (— )F = 0 

\dr r 1 


(24.188) 

(24.189) 


and 


(. E — me 2 + e<f>)G — he 
( E + me 2 + e<p)g + hei 


' d_ 
K dr 
d 

— + 
dr 



j + 3/2 ' 


G = 0 


(24.190) 

(24.191) 


So far, it has not been necessary to introduce the explicit form of the potential, but 
at this point we assume that the electron moves in the Coulomb field of a point 
nucleus of charge Ze : 
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Ve. also define the dimensionless quantities, 


A - i + 2 ’ 


£ 


mcr 


= e, 


x, 


mr ft 

he coupled radial equations then become 

d A + 1 


e — 


£ + 1 + 


Za 
x , 
Za 


! + -|F- - + 


)' + 5- 


X 

A - 1 


nd 


e - 1 + 


Za 


G- 


' d_ 
,dx 


Za\ 


8 + 1 + — g + — + 


A - 1 

x y 

A + 1 


x / 


dx 


he 

)/ = 0 
= o 

)« = 0 
|g = o 


= a 


(24.193) 

(24.194) 

(24.195) 

(24.196) 

(24.197) 


ince these two sets of equations are obtained from one another by the transfor- 
lation 


F — > G, f — > g, A — » — A 

; suffices to consider Eqs. (24.194) and (24.195). 

The analysis of the radial equations proceeds as usual. Asymptotically, for 
— > oo we find the behavior 


” F = f = e ±Vl “ s * 

or bound states, to which we confine our attention, we must require | e | < 1 and 
hoose the minus sign in the exponent. With the ansatz 

F = e~ VTZ ^ x x y 2 a v x v (24.198) 

*’=° 

f = e - y/T=px x* 2 bp x P (24.199) 

p=0 

it obtain by substitution, for v > 0, 

(e — l)*?,,-! + Zaa v + Vl — £ 2 b„_i ~ (A + 1 + y + v)b v = 0 (24.200) 

nd 

(e + \)b v -i + Zab v — Vl —~e^a v -i + (—A + 1 + y 4- v)a v — 0 (24.201) 

s well as 

Zaa 0 — (A + 1 + y)b 0 = 0, (— A + 1 + y)a 0 + Zab 0 — 0 (24.202) 

Equations (24.202) are compatible only if 

(Za) 2 — A 2 — (y + l) 2 

lence, 

y = -1 ±VA 2 - (Za) 2 = -1 ±V(/ + 1/2) 2 - (Za) 2 

he root with the minus sign must be excluded because the corresponding wave 
motion would be too singular at the origin to be admissible. Hence, 

y = - 1 + Vp- + 1/2) 2 ~ (Zaf 


(24.203) 
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Provided that Z < 1/a ~ 137, this will yield a real value for y. For j — 1/2 we have 
— 1 < y < 0 and the wave function is mildly singular at the origin, but not enough 
to disturb its quadratic integrability. (See Section 12.4.) 

The usual argument can now be made to show that both power series (24.198) 
and (24.199) must terminate at x" (see Chapter 12). 


Exercise 24.47. Carry out the study of the asymptotic behavior of (24.198) 
and (24.199), and show that the power series must terminate. 


From the recursion relations (24.200) and (24.201), we then obtain for 
v = n' + 1 (with a n '+i = b n - +1 = 0), 


b„- = 


I. Lu. 

1 + £ 


a„ 


(24.204) 


From Eqs. (24.200) and (24.201), we may simultaneously eliminate a v - x and h„_! 
to get 


a v [ZaV 1 + e + (A — 1 — y — p)Vl — s] 

= Z>„[Za V 1 - e + (A + 1 + y + v)Vl + s ] 


(24.205) 


Letting v = n' and comparing with Eqs. (24.204), we finally conclude that 


1 + £ 


ZaVl + £ + (A — 1 — y — n')Vl — £ 
ZaVl - e + (A + 1 + y + n')Vl + e 


Vl — £ 2 (1 + y + n') = Zas 
This condition translates into the formula for the energy: 


E 

1 

me 2 

V 

/, , < z “> 2 

[Vo* + 1/2) 2 - (Za) 2 + n'] 2 


(24.206) 


This is the famous fine structure formula for the hydrogen atom. The quantum num- 
bers j and n' assume the values 


j 


13 5 
2 ’ 2 ’ 2 ’ ’ ' * ’ 


n 


0 , 1 , 2 ,.. 


The principal quantum number n of the nonrelativistic theory of the hydrogen atom 
is related to n ’ and j by 

n = j + 1/2 + n' (24.207) 

From (24.204) we have for n ' = 0 

Vl — £ a 0 — Vl + e b 0 — 0 

This relatio n between a 0 and b 0 is consistent with (24.202) only if A + 1 + y > 0, 
or A > -VA 2 - (Za) 2 , hence A > 0. The transformation A — » — A, which takes us 
from the states with j = € + 1/2 to those with j = i — 1/2, is therefore not per- 
missible if n' = 0, and a solution of type (24.186) is not possible if n = j + 1/2. 
Hence, for a eiven Value of the nrincinal nnantum number n there is nnlv nne state 
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ith j = n — Ml, while there are two states of opposite parities for all j < n - Ml. 
ince A = j + 1/2 appears squared in the energy formula, pairs of states with the 
ime j but opposite parities (e.g., lS 1/2 and 2P 1/2 ) remain degenerate in the one- 
lectron Dirac theory. Experiments have substantiated this formula and its radiative 
orrections (the Lamb shift that removes the 1S V2 — lP\n degeneracy, displacing 
le S state slightly above P ) to very high accuracy, impressively vindicating the one- 
lectron approximation of the full relativistic electron-positron theory. Figure 24.1 
lows the fine structure of the energy levels with n = 1 and 3 of the hydrogen atom. 

Exercise 24.48. Expand the relativistic energy of a hydrogenic atom in powers 
f ( Za ) 2 to obtain the Bohr-B aimer formula and the first correction to the nonrela- 
vistic energies. For hydrogen (Z = 1), compute the energies of the ground state, 
le 25 1/2 metastable state, and the IP doublet (see Figure 24.1). 


3 ^ 1/2 


3 P 3/2 

0.1082 crrr 1 

3 P\!2 


3D 5 /2 

0.0361 cm" 1 

3D 3/2 


1.89 eV ~ 15,000 cm' 


,-i 


2 ^ 3/2 


0.3652 cm' 1 


— 1 2S 1/2 — 1 2 P VZ 

igure 24.1. Detail of an energy-level diagram for the hydrogen atom. The manifolds of 
le n = 2 and 3 levels are shown, based on the Dirac theory, without radiative corrections 
!,amb shift) or hyperfine splittings. The energy differences are given in units of cm -1 for 
te reciprocal wavelength. The Lamb shift displaces S levels upward in energy by about 10 
ercent of the fine structure splitting for the manifold. 
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Problems 

1. If A and B are proportional to the unit 4X4 matrix, derive expansion formulas for 
the matrix products (a • A)(a • B) and (a • A)(£ - B) in terms of a and £ matrices in 
analogy with formula (16.59). 

2. If a field theory of massless spin one-half particles (neutrinos) is developed, so that 
the /3 matrix is absent, show that the conditions (24.30) and (24.31) are solved by 
2X2 Pauli iqatrices, a = ±o\ Work out the details of the resulting two-component 
theory with particular attention to the helicity properties. Is this theory invariant under 
spatial reflection? 

3. Develop the outlines of relativistic quantum field theory for neutral spinless bosons 
with mass. What modifications are indicated when the particles are charged? 

4. Show that the vector operator 

Q = j8£ + (1 - j3)£ • pp 

satisfies the same commutation relations as £ and that it commutes with the free Dirac 
particle Hamiltonian. Show that the eigenvalues of any component of Q are ±1. 

Apply the unitary transformation 

exp [i(0l 2) {-p y Q x + p x Q y )lV'p x + p$] 

to the spinors (24.92) and (24.93), and prove that the resulting spinors are eigenstates 
of H with sharp momentum and definite value of Q z . Show that these states are the 
relativistic analogues of the nonrelativistic momentum eigenstates with “spin up’’ 
and “spin down.” 

5. Assume that the potential energy — e$(r) in the Dirac Hamiltonian (24.175) is a 
square well of depth V 0 and radius a. Determine the continuity condition for the Dirac 
wave function if/ at r = a, and derive a transcendental equation for the minimum 
value of V 0 which just binds a particle of mass m for a given value of a. 

6. Solve the relativistic Schrodinger equation for a spinless particle of mass m and 
charge — e in the presence of the Coulomb field of a point nucleus with charge Ze. 
Compare the fine structure of the energy levels with the corresponding results for the 
Dirac electron. 

7. Consider a neutral spin one-half Dirac particle with mass and with an intrinsic mag- 
netic moment, and assume the Hamiltonian 

fl 

H = ca • — V + /3 me 2 + A5/3£ z 

i 

in the presence of a uniform constant magnetic field along the z axis. Determine the 
important constants of the motion, and derive the energy eigenvalues. Show that 
orbital and spin motions are coupled in the relativistic theory but decoupled in a 
nonrelativistic limit. The coefficient A is a constant, proportional to the gyromagnetic 
ratio. 

8. If a Dirac electron is moving in a uniform constant magnetic field pointing along the 
z axis, determine the energy eigenvalues and eigenspinors. 
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The Appendix is a compilation of mathematical accessories, definitions, 
conventions, and mnemonics that are applicable in quantum mechanics. 
Instead of mathematical rigor, ease of use is the objective. 


Fourier Analysis and Delta Functions. We first consider the generally 
>mplex-valued, periodic functions defined on the real x axis, 

fix + L) = f(x ) (A.l) 

hich can be expanded in terms of the Fourier series 

1 4* oo h +00 

fix) = -= 2 ^“- = - 7 = 2 c n e 2 ™ ML An (A.2) 

VLn=-oo VLn=-co 

tost functions of interest in quantum mechanics are or can be approximated by 
inctions that are in this category. On the right-hand side, An = 1 is redundant, but 
/ inserting it we are preparing for the transition from Fourier series to Fourier 
tegrals. The Fourier coefficients are calculated from 

1 r L ' 2 

c n = ^ u)e 2 ™ UlL du ( A -3) 

VL J-u 2 

here the integration interval —LI 2 < x s LI 2 has been chosen for convenience. 
!how that any other interval of length L would give the same coefficients.) Sub- 
itution of (A. 3) into (A.2) gives the identity 

1 +°° rL/ 2 rL/ 2 I +«> 

fix) = 7 2 e (27rnm(x ~ u> f(u) du = du f(u) - 2 e (2 ™ ILKx - u) (A.4) 

^ n= — 00 J L/l J L/ 2 Lj n ~ _co 

the exchange of integral and summation is permissible. 

By taking the limit L — > °° and turning the Fourier series into an integral by 
Le transformation 
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Since for a fixed x we can change the value of f in the integrand almost everywhere 
(except near u = x) without affecting the value f(x), (A. 7) represents the delta 
function 


S(x 


1 f + 


ik(x—u ) 


dk 


with the property of being an even function of its argument, 

8(x — u) = 8(u — x ) 

and 


f(x) 


-r. 


8(u — x)f(u) du 


(A. 8) 


(A.9) 


(A. 10) 


If condition (A. 10) is applied to a simple function defined such that /( x) = 1 for 
Xy < x < x 2 and f(x) = 0 outside the interval (jci, x 2 ), we see that the delta function 
must satisfy the test 


f 


8(u — x) du — 


\ 0 if x lies outside (x lt x 2 ) 
[l if Xi < X < x 2 


(A. 11) 


This equation may be regarded as the definition of the delta function. It is effectively 
zero whenever its argument differs from zero, but it is singular when its argument 
vanishes, and its total area is unity. 

The infinite integral in (A. 8) does not exist by any of the conventional defini- 
tions of an integral. Yet it is convenient to admit this equality as meaningful with 
the proviso that the entities which are being equated must eventually be used in 
conjunction with a well-behaved function f(x) under an integral sign. Physically, 
(A. 8) may be interpreted as the superposition with equal amplitudes (and equal 
phases at x = u) of simple harmonic oscillations of all frequencies. The contributions 
to the Fourier integral completely cancel by destructive interference unless the ar- 
gument of the delta function vanishes, i.e., x — u. 

By choosing to make the delta function (A. 8) the limit of a well-defined integral, 
we obtain (with u = 0) various useful representations of S(x): 


1 C +K 1 f + 

5(x) = — - lim e lkx dk — — lim 

Z7 T K—> os J K 2.TT £— >0"*" J 0 

- f lim f 

Z7T £_ >0+ J- 


e ikx ~ eV dk 


(A. 12) 


e ibc-s\k\ dk 


and many more can be invented. (Try some.) Small (“infinitesimal”) quantities like 
s are assumed to be positive everywhere in this book, without special notice. The 
representations (A. 12) are, explicitly 


aw = I i ta 

7T jV— > co X 


1 r 1 x \ 1 r 

— 7 = lim - exp I r = — lim 

V IT e-»0 + e V S / 77 


2 i 2 

0 + X + S 


(A. 13) 


Two further useful representations are: 

1 f + 

8(x) = ~ lim 

ITT e— »0 + J- 


+ * 


ks 


lim 

e-*0+ 


l/2e if — e < x < s 
0 if Ul > e 


(A. 14) 
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2 dx 2 


S ( x ) = ~ Tj I* 


(A. 15) 


i quantum mechanics, we also need the incomplete Fourier integral, 



e Kco-is)t dt 


e iio>+e)T 

ico + e 


(A. 16) 


: the limits T — » °° and s — > 0 are taken, but such that sT 0, i.e., the e-limit 
recedes the T-limit, (A. 16) is just 27r times (A. 8 ), or 

(io>+s)T 

lim 7 — 27r5((u) (A. 17) 

it o + e 

T — »°o 

(er-> 0 ) 


? , on the other hand, we choose F = 0 in (A. 16), we obtain 



e i(a>-is)t df 



e ~{i<o+e)t dt _ 


1 

ia> + e 


— ia> + e _ e . o> 

w 2 + s 2 u > 2 + e 2 * ft ) 2 + e 2 


(A.18) 


i the limit e — > 0, (A. 13) shows the first term on the right-hand side to be 7 r< 5 (ft>), 
nd the second term becomes —//ft) except if ft) = 0. If /(ft)) is a well-behaved 
mction, we have 


lim 

e->0 + 



ft) 


ft) 2 + 8 2 


do) 


lim 

E— > 0 + 



lim 

e— »0+ 


+ 


lim 

8->0+ 



i od(o 
ft ) 2 + s 2 



do) 

ft) 


'here P denotes the Cauchy Principal value of the integral. The integral from — s 
) e vanishes because, since f((o ) is smooth, the integrand is an odd function of ft), 
fence, (A.18) may, with the usual convention about its use in integration, be written 
s 


lim — - — = 7 t8((o) — iP — (A. 19) 

e _* 0 + l(i) + s ft) 

;oth (A. 17) and (A. 19) are useful in formal scattering theory (Chapter 20). From 
\.19) we deduce that 

(A.20) 


lim 

e— >0+ 


1 ' 
2 _ 


1 


cu + is 


+ 


ft) 


is 


= P- 

ft) 


nd 


lim 


1 

<y + is 


— 2m8((o) 


(A.21) 


7e also require the value of the Fourier transform of 1 /(ft) — is) for finite values of 
> 0. We can obtain this by integrating (A. 16) over all co : 



i<oT 

do) 

id) + s 
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from which we conclude that, for positive e. 



io>T 

dot 

to — is 


(2m e~ eT if T> 0 

[0 if r<o 


(A.22) 


This relation is important for exponential decay processes. 

We can also use this last equation to construct the Fourier representation of the 
Heaviside step function, which is defined as 

f x [ 1 x > 0 

rj(x) = 8(u) du = \ (A.23) 

|^0 x < 0 

From (A.22) we see that 

J C + co e i(OX 

- — ; I — do) = e~ sx r](x) (A.24) 

2m J-oo co — is 


If we take the limit s 


0 and substitute (A. 19) in the integrand, we obtain 


, x 1 1 _ 



i<ox 

do) 

(0 


(A.25) 


We conclude this section by presenting several useful identities involving the 
delta function. The proofs are easy. First, we have the identity: 


S(ax) — t — 7 S(x) (A.26) 

\ a \ 

for any nonzero real constant a. Next, we see that for a well-behaved function /(x), 

f(x)S(x — a) = f(a)S(x — a) (A.27) 

A simple inference is that 


xS(x) = 0 


(A.28) 


Equation (A.26) can be generalized to give a formula for the delta function of a 
function g(x). The delta function S(g(x)) vanishes except near the zeros of g(x). If 
g(x) is analytic near its zeros, x,, the approximation g(x) ~ g'(x,)(x — x,) may be 
used for x « x t . From the definition (A. 10) and from (A. 26) we infer the equivalence 

S(g(*)) = X s (* ~ xt ) (A.29) 

provided that g'ixd # 0. A special case of this identity is 

8((x — a){x — b )) = t -7 [S(x — a) + S(x — Z?)] (A.30) 

| a — b | 

For example, 

S(x 2 - a 2 ) = tt r [5(x - a) + 8(x + a)] (a + 0) (A.31) 

2\a\ 

and 


S(Vx - Va) = 2 Va 8(x - a) (a > 0) (A.32) 



4 


Appendix 


The theory of Fourier integrals and the delta function in three dimensions is 
-aightforward, if we generalize (A. 8) by defining 

«(r) = —-3 \ (A.33) 

Review of Probability Concepts. In this brief review of probability concepts, 
e assume familiarity with the basic rules of adding and multiplying probabilities. 1 
le results of a real or imagined preparation, experiment, test, or trial on a system 
e unambiguously identified as outcomes O l , 0 2 , ■ . . We assume that each possible 
itcome, O h is assigned a probability of occurrence p t with 

S Pi = 1 

i 

ie set of probabilities p, is called a (probability) distribution. In quantum physics, 
i experiment leading to a set of possible outcomes with probability p,- is generically 
ferred to as a measurement, even though the outcome may not necessarily measure 
e value of a particular quantity. 

In rolling an ordinary die, the six possible results, the number of dots on the 
p, may be identified as outcomes O x through 0 6 , their probabilities being p t = 
= ... — p 6 — 1/6, if the die is perfect, unbiased, and not loaded. If, on the other 
ind, the “even” (2, 4, 6) or “odd” (1, 3, 5) character of the die roll is chosen for 
io distinguishable outcomes, 0 1 and 0 2 , the probabilities are 1/2 each. 

A variable X which takes on the value of X x if outcome O x occurs, X 2 if 0 2 
;curs, etc., is called a random variable. A function of a random variable is itself 
so a random variable. If in rolling the die you are promised 2 pennies for each dot 
l the die, your winnings constitute a random variable with values X ( — 2 i. 

The expectation value (or expected or mean value), (X) [or E(X) in the notation 
vored by mathematicians], of the random variable X for the given probability 
stribution is defined to be the weighted sum 


(X) = 2 x iPi 

i 


(A.34) 


i the example, (X) — (2 + 4 + 6 + 8 + 10 + 12) X 1/6 = 7 pennies. Seven 
mnies is the expected gain per die roll, whence the term expectation value for (X). 
/he game will be a fair one, if the ante is 7 pennies.) 

The variance (AX) 2 of the random variable X is defined by 


(AX) 2 = <(X - (X)) 2 ) = X (X, - (X)) 2 p ( . = <X 2 > - (X) 2 

i 


(A.35) 


is a measures of the deviation from the mean. In the example of the die, the random 
iriable X with values X, = 2 i (i = 1, ... 6) gives the variance 


(AX) 2 - 4 X 


l 2 + 2 2 + 3 2 + 4 2 + 5 2 + 6 2 


?2 = Z2 


id the root-mean-square or standard deviation is AX = 3.42. 


'For more on probability, see Chapter 6 in Bradbury (1984). 






2 Review of Probability Concepts 


635 


In the rigorous formulation of the Heisenberg uncertainty relations we also 
encounter the covariance of two random variables X and Y: 


((X - (X))(Y - <y»> - (XY) - (X)(Y) 


(A.36) 


If X and Y are independent (uncorrelated), the average of the product XY equals the 
product of the averages (X)(Y), and the covariance vanishes. 


Exercise A.l. In the example of the perfect die, let X = 0 if the number of 
dots is less than its mean value and X — 1 if it exceeds its mean value, and Y — 0 
or 1 depending on whether the number of dots is even or odd, respectively. Compute 
the covariance. 


We can relate a given probability distribution to the statistical properties of a 
string of A similar trials which lead to outcomes, O x , 0 2 , ■ ■ . O n with probabilities 
Pi, p 2 , . ■ ■ p„- The total number of distinguishable strings of N outcomes is n N . The 
number of these strings in which outcome O x occurs N x times, outcome 0 2 occurs 

n 

N 2 times, and so forth, with 2 Af,- = A, regardless of their order of occurrence in 

1 

the string, is the multinomial coefficient 


(A; N x , N 2 ,... N„) = 


A! 


N x \N 2 \ . . . NJ 


(A.37) 


The probability that in the A trials outcome O x occurs N x times, outcome 0 2 occurs 
N 2 times, and so forth, is given by the multinomial distribution, 


W; n x , n 2 ,... a„) = 


m 


P N 1'P2 2 ■■■Pn" 


(A. 38) 


N x \N 2 \ ...N n \ 

The mean value (A,) of the number of successes in producing the outcome O t is 

<A0 ^ N Pi (A.39) 


Exercise A.2. Show, by use of the multinomial expansion formula, that for 
given values of N and n, the sum of terms (A.37) is equal to n N and that the prob- 
abilities (A. 38) add up to unity. Prove (A.39) and show that the variance (A N t ) 2 is 

(AA,.) 2 = (A 2 ) - (N t ) 2 = Api(l - pd (A.40) 

Formulas (A.39) and (A.40) show that in the limit of large A, the multinomial 
probability distribution is sharply peaked at A, = Np t (i = 1,2 ,...,«). Hence, the 
average value (or mean value) of X obtained in the string of trials is, with high 
probability, equal to the expectation value (X), 

2 ^ 

lim —— - = {X) (A.41) 

N-*° c A 

an expression of the law of large numbers. The two terms, average and expectation 
value, are therefore often used synonymously. 

In quantum mechanics, as in classical physics, our information about the state 
of a system is often necessarily less than maximal and we must rely on statistical 
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ethods to describe the state. The entropy, which is a measure of disorder or ran- 
imness in statistical thermodynamics, is thus expected to be of even greater im- 
jrtance in quantum mechanics, since the intrinsically probabilistic nature of quan- 
m mechanics introduces novel features that have no counterpart in classical 
lysics. 2 * 

The number (A. 37) of distinguishable strings that correspond to the maximum 
? the multinomial probability distribution is 


(A; {N,), (N 2 ), . . . (AO) 


Nl 

(Np 1 )\(Np 2 y. . . . (Np n )\ 


(A.42) 


or large values of N, this is generally a very large number, the magnitude of which 
a measure of the degree of randomness of the original probability distribution, 
mce, being a probability, P(N; (A^), ( N 2 ), . . . , ( N n )) must be less than unity, we 
in derive an asymptotic estimate of (A.42) from (A. 38): 


N\ 


(N Pl )\(Np 2 y. . . . (N Pn y 


= pt Npi py NP2 


Pn 


N Pn 


s in statistical mechanics, it is preferable to express this quantity in terms of its 
igarithm: 


log(A; m, (N 2 ), . . . (N n )) « 


n 


-N X Pi log Pi 

i= 1 


It thus arrive at a quantity that is useful in characterizing the degree of randomness 
r disorder of a given probability distribution: 


H = ~X Pi ln Pi = -In 2 X Pi lo §2 Pi = -0.693 X Pi lo g 2 Pi 


(A.43) 


nown as the Shannon entropy, in analogy to the Boltzmann entropy, H can be 
iterpreted as an average measure of the information that is missing and that is 
ained when it is determined which of the distinguishable outcomes satisfying the 
andition X Pi = 1 occurs. The unit of the entropy H in (A.43) is called the nat. If 

i 

ie entropy is expressed as — X Pt l°g 2 Po the unit of information is the bit, which 


; equivalent to 0.693 nats. In the example of the die, the missing information is 

l 1 l\ 

X I — - log 2 - I = log 2 2 = 1 bit if the outcomes are “even” and “odd” rolls; 
is 6 X ( — y: log 2 = log 2 6 = 2.58 bits if the outcomes are the six different 


ossibilities of the roll of the die. Roughly, the value of the entropy in bits tells us 
ie minimum number of binary yes-no questions about the outcome of the exped- 
ient that an uninformed observer must ask to be assured of learning which event 
as occurred. The value of H in bits is also a lower bound for the average length of 
le sequence of binary digits required to code a set of n messages, if the ith message 
ccurs with probability p t . 


2 Jones (1979) is a useful book on basic information theory. Quantum information theory, as well 

s quantum coding, is discussed in Peres (1995). See also Schumacher (1995). 
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Exercise A.3. Devise a strategy of asking yes-no questions that guarantees 
that one can ascertain the outcome of the roll of the die, with six distinct possibilities, 
in three attempts or less. 

Exercise A.4. Compute the Shannon entropy in bits for the head-tail toss with 
an unbiased coin, and compare the result with the value of H for a slightly biased 
coin (p h eads — 0.48, p tails = 0.52). How many trials with the biased coin does it take 
to ensure that the average number of “heads” differs from the average number of 
“tails” by more than two root-mean-square deviations? 

A key property that characterizes the entropy, and that the Shannon definition 
(A. 43) satisfies, is that if two statistically independent probability distributions 
p'i and p'j are combined by multiplication into one joint probability distribution, 
Pij ~ P'iP'j » the corresponding entropies add: 

H = Pij In Pij = “2 P'iP'j ln p'p'j 

u u (A.44) 

= “2 P'i In p'i - S P"j ln P'j = H ' + H " 

i j 

The connection between the information entropy (A. 43) and the thermodynamic 
entropy of a physical system in equilibrium can be glimpsed if we inquire about the 
probability distribution which, given the known physical constraints on the system, 
maximizes the missing information. Here we merely treat a simple idealized model. 
The application to a more physically realistic situation is outlined in Section 22.5. 
Suppose that the system is constrained only by the expectation value (X) of a random 
variable X, which we assumfe to be known. We ask for the probabilities p t which 
maximize the entropy H, subject to the normalization condition 

2 Pt = 1 

i 

and the condition that (X) must take on the prescribed value. Using the method of 
Lagrangian multipliers, we must determine the extremum of H — k{X)\ 

8H - A 8(X) = 8pi ln /?, - 2 8 Pi ~ A 2 X i 8 Pt = 0 (A.45) 

i i i 

Because of the normalization, the variations of the probabilities must be related by 

S 8p t = 0 

i 

Thus, (A.45) gives the optimal probability distribution as 

Pi = Ce- xx > (A.46) 

The constants C and A are determined by the two constraint equations. The proba- 
bility distribution (A.46) has an uncanny similarity with the Boltzmann distribution 
for a canonical ensemble in statistical thermodynamics, if X denotes the energy and 
if the Lagrangian multiplier A is identified with the reciprocal temperature in appro- 
priate units. The reciprocal of C is known as the partition function. 

Exercise A.5. Show that in the absence of any constraint, the Shannon entropy 
H is maximized by the uniform probability distribution p x = p 2 — ■ ■ ■ — p„ — 1 In, 
if n is the number of accessible outcomes. (This is a model for the microcanonical 
ensemble of statistical mechanics.) 
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Exercise A. 6. Assuming that the expectation values of several random vari- 
>les are known, generalize the derivation of the probability distribution that max- 
lizes the entropy subject to these multiple constraints. (This is a model for the 
•and canonical ensemble of statistical mechanics.) 

In this section, all formulas were written specifically for discrete distributions, 
i most cases of interest in this book, the extension to continuous distributions, with 
tegrals replacing sums, is straightforward, provided that one employs an appro- 
bate density of outcomes (or density of states). From a practical point of view, we 
in in many cases, approximate continuous probability distributions by “discre- 
zed” ones, in which small ranges of outcomes are assigned to discrete “bins.” 
rch a procedure is natural in many physical settings and analogous to experimental 
chniques with finite resolution, which invariably involve collecting data over nar- 
»w ranges of continuous variables. The continuous angular distribution of particles 
;attered from a target into detectors with finite aperture illustrate this point. 

Curvilinear Coordinates. The symmetry of a problem often dictates the most 
ivantageous choice of coordinates. Spherical polar coordinates are convenient for 
roblems with rotational symmetry, and rectangular Cartesian coordinates are ap- 
ropriate for problems with cubic symmetry. Parabolic coordinates are convenient 
>r the Coulomb potential, even in the presence of a uniform electric field. Here we 
immarize the essential formulas for expressing the Schrodinger equation in arbi- 
ary curvilinear coordinates. 

The location, of a particle in space is specified by its generalized coordinates: 

q, = q t ( r) (* = 1, 2, 3) (A.47) 

he differential properties of the coordinates are conveniently characterized by the 
iements of the metric tensor g ik , which are the coefficients in the quadratic form 
lat represents the square of an infinitesimal displacement: 

4 

ds 2 = g ik dq l dq k (A.48) 

he summation convention is assumed to be in force: One must sum over repeated 
idices, if one of them is a subscript and the other a superscript. We assume that 
le space is Euclidean, or flat, so that it is possible to express the metric as 

ds 2 — (dxx ) 2 + (dx 2 ) 2 + (dx 3 ) 2 — dx 2 + dy 2 + dz 2 
l general coordinates, the Laplacian operator has the form 3 

V 2 = diVgg‘% (A.49) 

'here we have denoted the partial derivatives as 

di = (A.50) 

dqi 


3 For a proof see Bradbury (1984), Chapter 4, Section 10. 
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The quantity of g is the determinant of the metric tensor: 

8 = det g ik 


(A.51) 


General coordinates are orthogonal if and only if the metric tensor is diagonal. For 
orthogonal coordinates, 


, 8 — 811822833 

and the Laplacian (A.49) reduces to 
1 


V 2 = 


Vg 


d 822833 d ^ d 833811 j_ d 811822 d 

pqi \ gn d<li dq 2 V 822 dq 2 dq 2 "y g 33 dq 2 J 


(A.52) 


As an example, we derive the Laplacian for parabolic coordinates by a direct cal- 
culation and then compare the result with (A.52). 

Parabolic coordinates are defined by 


= Vjv cos <p y — v^n sin <p z = ^ (g - v) 


(A.53) 


Hence, 


= Vtq cos <p x + \/Jrt sin (p y + ^ (£ - rj) z 


(A.54) 


and 


1 




v 


1 




dr — - ( /— cos x + /— sin <p y + z + - I — cos <p x + — sin (p y — z 1 drj 


V 


V 


+ (~ Virj sin <p x + Vlrj cos <p y) <i<p 


^ rr ^ d ^ + \ ^ dv + '^ n Q d(p 


(A. 55) 


The last equation defines the orthogonal basis vectors that span the parabolic coor- 
dinate system. Since the gradient operator V is defined by 


d d d 

dr • V = dg — + dr] 1 -dip — 

d£ dr} dip 


we obtain 


V = € 2. 


* 




I + 77 af, 


£ + r? dr?, 


+ ip 


(A.56) 


1 4) (A.57) 




After some algebraic manipulations working out the partial derivatives of the basis 
vectors with respect to the parabolic coordinates, we obtain for the Laplacian, 


V 2 = V • V = 


— (e — 


+ 


£ + 1 ? df d£/ £ + 7 } dr} dr} 


1 d 2 

+ (A .58) 

£r} dip 
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a the other hand, in parabolic coordinates, the metric is found from (A. 55) to be 

ds 2 = dr ■ dr = - — ~r ~ d£, 2 + — A — — dr] 2 + £17 d<p 2 (A.59) 

4rj 

om which the metric tensor can be obtained and the Laplacian (A. 52) calculated, 
le result agrees with (A. 5 8). 

Exercise A. 7. Derive the gradient and Laplacian in spherical polar coordi- 
ites, using the techniques of this section. 

Units and Physical Constants. In line with common practice in introductory 
eoretical physics, in this book the Gaussian system is used for defining physical 
lantities. Table A.l shows how symbols for electromagnetic quantities in equations 
id formulas (but not their numerical values) are converted from the Gaussian sys- 
m to the Heaviside-Lorentz (or “rationalized” Gaussian) system, frequently used 
particle physics, and the symbols that underlie the SI system of units. 4 


able A.l 


Quantity 

Gaussian 

Heaviside-Lorentz 

SI 

harge 

0 

q or e 

q or e 

V47T V^7 T 

q III e 

V47re 0 V47re 0 

harge and current density 

p and j 

-~= and — 1= 
\Arr V47T 

P i|r] j 

V477s 0 V47re 0 

lectric field 

E 

V47T E 

V / 47ts 0 E 

lectrostatic potential 

<t> 

V 47 T 4> 

V47re 0 (j) 

ector potential 

A 

V477 A 

V47re 0 A 

[agnetic field 

B 

V477 B 

V47re 0 B 

[agnetic moment 


V47T 

[do 

peed of light 

c 

c 

1 

VeojU-o 

irita etmobiro nnnetont rv 


e 2 

e 2 

Inc structure cunsid.ni ex 

he 

47 The 

47re 0 ^c 


In Table A. 2, we list numerical values of important quantities and fundamental 
instants. 5 No rigid system of units is slavishly adhered to in this table, since the 
hoice is always suggested by the context in which the quantities are used. 

4 In Table A.l we adopt the format used in Jackson (1975), Appendix. 

5 The numbers in Table A. 2 are adapted from Cohen and Taylor (1996). 
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Table A.2 


c (speed of light) 

2.9979 • 10 8 m s" 1 

h 

6.6261 • 10“ 34 J s 

h = hi 2 tt (Planck’s constant) 

1.0546 • 10 -27 erg s 

6.5821 • 10“ 16 eV s 

e (electron charge) 

1.6022 • 10“ 19 C 

i 

m e (electron mass) 

9.1094 • 10" 31 kg 

0.51099 MeV/c 2 

h 

(Compton wavelength/2-Tr) 

3.8616 • 10~ 13 m 

m e c 

Jm e (neutron-electron mass ratio) 

1838.7 

u (atomic mass unit) 

1.6605 • 10- 24 kg 

h 2 

a = , (Bohr radius) 

0.52918 • 10“ 10 m 

m e e 

- (ground state energy of hydrogen atom) 

13.61 eV 

e 2 

a — — (fine structure constant) 
he 

7.2974 • 10 -3 - 1/137.035 

eh 

9.2740 • 10" 24 J T -1 

p 0 — (Bohr magneton) 

2 m e c 

9.2740 • 10 -21 erg/gauss 
5.788 • 10” 5 eV T -1 

k (Boltzmann’s constant) 

1.3807 • 10“ 23 J K" 1 
8.6174 • 10“ 5 eVK" 1 

eV (electron volt) 

1.6022 • 10" 19 J 


Often it is convenient to work with natural units, which are based on quanti- 
ties that are prominent in a particular subfield of physics. For example, atomic and 
condensed-matter physics problems are conveniently formulated in terms of natural 
units defined by setting 

h = 1, m e = 1, e = 1 (A.60) 

With this choice, the Bohr radius, a = h 2 /m e e 2 , becomes the unit of length, and 
2 E ls (27.21 eV) the unit of energy, known as the Hartree unit. Since the fine struc- 
ture constant is dimensionless, the speed of light in these natural atomic units (or 
a.u.) is c « 137 a.u. Since the proton mass is m P — 1836 a.u., the kinetic energy of 
a proton with velocity u — 1 a.u. is £ = 918 Hartree units, or about 25 keV. 

In particle physics, the preferred natural units are quite different, since one often 
works in the relativistic regime. A frequent choice is 

h= 1 c — 1 (A.61) 

and a third unit, usually an energy, such as 1 GeV. The value of e is now deter mined 
by the fine structure constant to be e — Va? in Gaussian units and e = ~ 0.3 

in Heaviside-Lorentz units. 
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Absorption of radiation, 491-501, 
577-579 

cross section for, 494-501, 510, 
514, 591 

by harmonic oscillator, 561 
rate of, 498 
selection rules for, 497 
sum rules for, 499 
Action function, classical, 355 
Action integral, see Phase integral 
Active transformation, 76, 201, 41 1 
Addition of angular momenta, 
426-431, 555-556 
Addition theorem for spherical 
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Additive one-particle operator, 544, 
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Additive two-particle operator, 545 
Adiabatic approximation, 161 
Adiabatic change, 151 
Adiabatic invariant, 151 
Adjoint Dirac field operator, 599 
Adjoint of an operator, 52-53, 192 
Aharonov-Bohm effect, 78 
Airy equation, 117,123 
Airy function, 117-119, 123, 139 
Alkali atoms, spectra of, 469 
Allowed bands, 166 
Allowed transitions, 496 
Almost degenerate levels, 

perturbation of, 145, 463 
Alpha decay, 133 
Amplitude, 8 
Analyzing power, 403 
Angular distribution, 281, 301 
Angular momentum, 233-255, 
414-439 

addition of, 426-431, 555-556 
commutation relations for, 234, 
238, 384, 413 

conservation of, 233, 330-331, 
375, 389-390, 414-415 
as constant of motion, 256, 389, 
414-415, 624 
coupling of, 426-431, 449, 
472-473, 555-556 
eigenvalue problem for, 238-248 
in Dirac theory, 604, 624 
Euler angles representation of, 
449-450 

as generator of infinitesimal 
rotations, 236, 382, 389, 

413, 449 

intrinsic, 372-377, 389. See also 
Spin 

and kinetic energy, 252-255 
in many-particle system, 

555-556 

orbital, 233-237, 425, 443 
parity of eigenstates, 249, 44 Q 
of photons, 569-570, 575-576 
of rigid bodies, 449-450 
rotations and, 236, 381-385, 
413-416, 603-604 


superselection rule for, 414 
and time reversal, 443 
total, 389-390, 416, 426, 
469-470, 555, 604, 624 
eigenstate of, 430 
for two identical particles, 
555-556 

Angular momentum operator, 238, 
413 

Angular momentum quantum 
number, 239, 245, 390, 
422-423, 428-429, 555, 627 
Anharmonic oscillator, 177 
Anisotropic harmonic oscillator, 
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Annihilation operator, 538, 

543-544, 567. See also 
Lowering operator 
A-normalization, 61 
Anticommutation relations for 
fermions, 540-543, 590 
for Dirac field, 598 
Antilinear operator, 34, 188, 192, 
412. See also Time reversal 
operator 

Antisymmetry of fermion wave 
function, 547 

Antiunitary operator, see Time 
reversal operator 

Associated Laguerre polynomials, 
271 

Associated Legendre functions, 247 
Asymmetry in scattering, right-left, 
376-377, 403 
Asymptotic expansion: 
of Airy function, 117-119 
of Coulomb wave function, 312 
of parabolic cylinder function, 
157 

of plane wave function, 262 
of scattering eigenfunction, 287, 
294, 298-300 

of spherical cylinder function, 
259-260 

and WKB approximation, 120 
Auger transition, see Radiationless 
transition 

Autoionizing transition, see 
Radiationless transition 
Average value of random variable, 
635 

Axial vector operator, 440 
Axial (pseudo-)vector operator in 
Dirac theory, 605 
Axis of quantization, 244, 423 

Balmer formula, 2, 267, 269, 628 
Bands, allowed and forbidden, in 
periodic potential, 166-178, 

481 

Band theory of solids, 166 
Barrier penetration, 97-98, 

125-133, 150, see also 
Tunneling 

Basis functions, 139 


Basis states, 186, 379, 537 
Bell, John S., 9, 362 
Bell’s theorem, 18, 362 
Berry’s connection and geometric 
phase, 162 

Bessel functions, spherical, 258 
Bessel’s equation, 258 
Bessel’s inequality, 58 
Black-body radiation, 590 
Bloch function, 71, 168, 295 
Bohm, David, 29 
Bohr, Niels, 1-10, 18 
Bohr frequency condition, 1, 20, 
396, 492 

Bohr magneton, 374 

Bohr radius of hydrogen, 267, 641 

Boltzmann distribution, 91 

Boltzmann statistics, 566 

Boost, 75, 607 

Born, Max, 7 

Born approximation, 295-298, 314, 
526, 534 

Born interpretation, see Probability 
interpretation, 

Born-Oppenheimer approximation, 
161 

Born series, 526 
Bose-Einstein: 

commutation relations, 543, 546 
condensation, 11 
statistics, 543, 566 
Bosons, 543 

Boundary conditions, 263-265 
at fixed points, 66. See also 
Periodic boundary 
conditions 

at infinity, 43-45, 82, 104, 262, 
265, 626 

at origin, 264, 299, 626-627 
Bounded Hermitian operator, 
212-214 

Bound states, 83, 103-108, 

121-125, 262-263, 265-277 
Bra, 196 

Bragg reflection, 176 
Bra-ket, 205 

Breit-Wigner resonance, 130, 306 
Brillouin, Ldon, 113 
Brillouin’s theorem, 562 
Brillouin-Wigner perturbation 
theory, 467 

Brillouin zone, 70-71, 167, 172. 

See also Dispersion function 

Campbell-Baker-Hausdorff 
formula, 40 

Canonical coordinates and 
momenta, 326 

Canonical quantization, 326-332 
Canonical transformation, 329 
Casimir (van der Waals) forces, 

574 

Casimir operator, to label 

irreducible representations, 
422 



48 


Index 


asimir projection operator, 608 
auchy principal value, 293, 513, 
632 

ausality, principle of, 315, 319 
ayley-Hamilton theorem, 212 
entral forces, 256-275, 623-628 
scattering from, 298-302, 
530-532 

entrifugal potential, 257 
-G coefficient, see Clebsch- 
Gordan coefficients 
hange of basis, 199-202, 

538-542 
haotic state, 589 
haracteristic equation, see 
Determinental equation; 
Secular equation 
haracteristic value, 140. See also 
Eigenvalue 

harge conjugation, 408, 608-610 
hemical bond, 164 
hemical potential, 565 
hirality, 439, 620 
hiral solutions of Dirac equation, 
622 

Ihiral symmetry, 620, 622 
lassical approximation, 3, 123. 

See also Correspondence 
principle; WKB 
approximation 

'lassical dynamics, relation to 
quantum dynamics, 324 
Classically accessible region, 116 
Classical turning point, 116 
Clebsch-Gordan coefficients, 

427 

orthonormality relations, 429 
recursion relation, 428, 432 
selection rules, 428 
symmetry relations, 429, 436 
triangular condition, 428 
Clebsch-Gordan series, 431-432 
Closely coupled states, 486-487 
Closure relation, 62-63, 67-68, 

198, 529, 597 

Coherence, of field, 583-586 
Coherent state(s), 220, 225-231, 
583 

displacement operator for, 225 
generated by current-field 
interaction, 581-582 
and Heisenberg uncertainty 
relation, 229 
inner product of, 226 
overcompleteness relation for, 
227, 365 

relation to number operator 
eigenstates, 227 
representation by entire 
functions, 228 
rotation operator for, 226 
time development of, 335, 
340-342, 353 

Hoherent superposition, 301 
^Coincidence rate, 588-589 
[Collapse of wave packet, 408 
[Commuting Hermitian operators, 
214-217, 407 
[Commutation relations: 
for angular momentum, 234, 238, 
384, 413 

for bosons, 540-543 
for canonically conjugate 
variables, 326, 332 


for coordinates and momenta, 38, 
204, 325, 344 

for creation and annihilation 
operators, 540-543 
at different tiroes, 332 
for electromagnetic field, 576 
for field operators, 546 
for photon operators, 569 
for tensor operators, 436 
for vector operators, 236, 384 
Commutator, 37, 38-41, 218, 326, 
332 

Compatibility of observables, 180, 
407 

Complementarity, 4 
Completeness: 

of Dirac matrices, 605-606 
of dynamical variables, 372 
of eigenfunctions, 46, 57, 206, 
350 

of eigenvectors, 180, 198, 
214-217, 529 

of Hermite polynomials, 88 
of spherical harmonics, 249 
Completeness relation, 59, 206, 

364 

Complete set: 

of basis vectors, 209 
of commuting Hermitian 
operators, 180, 216 
of eigenfunctions, 59 
of functions, 142 
of observables, 180 
of vectors, 186, 217 
Complex potential, 78 
Complex vector space, 185 
two-dimensional, 377-381 
Complex wave function, 13 
Composition rule for amplitudes, 
182, 315 

Configuration, 559 
Configuration space, 359, 547 
Confluent hypergeometric equation, 
270 

Confluent hypergeometric 

functions, 156, 270, 311 
Connection, for gauge field, 162, 
447 

Connection formulas, WKB 

approximation, 116-121 
Conservation: 

of angular momentum, 233, 
330-331, 375, 389-390, 
414-415 
of charge, 600 
of current, 600, 623 
of energy, 38, 43, 321, 503, 509 
of linear momentum, 38, 330 
of parity, 441, 460 
of probability, 26-28, 42, 94-95, 
100, 121, 318, 391, 514 
Conservation laws, see Constant of 
the motion 

Constant of the motion, 37, 319, 
330, 415 

Continuity equation, 26, 28, 36, 74, 
599-600 

Continuous spectrum, 44, 60-62, 
94, 181, 202-206, 284, 546, 
592 

Continuum eigenfunctions, 60-62 
for Coulomb field, 310-312 
Contraction: 

of two vectors, see Inner product 


of irreducible spherical tensor 
operators, 436 

Convection current density, 623 
Convergence in the mean, 58, 142 
Coordinate operator, matrix 
element of: 

in coordinate representation, 204 
in momentum representation, 346 
in oscillator energy 

representation, 88, 224 
Coordinate representation, 32, 204, 
344-348 

wave function, 180, 345 
Copenhagen interpretation, 18 
Correlated state, 406, 552 
Correlated wave function, 361 
Correlation function, for field 
operators, 585-586 
Correspondence principle, 3, 37, 
324 

Coulomb barrier, 127 
Coulomb excitation, 487-491 
Coulomb gauge for radiation field, 
491, 572-573, 582 
Coulomb interaction between 
identical particles, 553 
Coulomb potential, 265-275, 625. 

See also Hydrogen atom 
Coulomb scattering, 310-312. See 
also Rutherford scattering 
cross section 
Counting rate, 280 
Coupled harmonic oscillators, 371, 
480, 568 

Covalent bond, 164 
Covariance, 635 

Creation operator, 538. See also 
Raising operator 
Crossing of energy eigenvalues, 
465-466 

Cross section, 278-286 
absorption, 494-501, 510, 514, 
591 

Coulomb scattering, see 

Rutherford scattering cross 
* section 

differential scattering, 281, 290, 
301, 312, 401, 520, 525 
partial, 301, 304-306 
photoemission, 502 
resonance, 306 
total scattering, 281, 290, 
301-302 

Current density, 26, 553, 600, 610, 
623. See also Probability 
current density 
gauge invariant, 74 
Current distribution, interaction 
with photon field, 580- 
582 

Curvilinear coordinates, 638-640 

de Broglie, Louis, 2 
de Broglie relation, 2, 12 
de Broglie wavelength, 2, 115 
Debye potential, 277, see also 

Screened Coulomb potential; 
Yukawa potential 

Decaying state, 132-133, 307, 392, 
514. See also Exponential 
decay 

Degeneracy of energy eigenvalues, 
44-45, 144, 207. See also 
Repeated eigenvalues 
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absence of, in linear harmonic 
oscillator, 83 
connection with group 

representation, 419-420 
for free particle, 65-66 
for hydrogen atom, 267-270 
for isotropic harmonic oscillator, 
276 

for periodic potential, 176 
removal of, by perturbation, 
144-146 3 

Degenerate perturbation theory, 
463-467 

for periodic potential at band 
edges, 481 

Delta function, 630-634 
partial wave expansion of, 252 
for solid angle, 252 
Delta function normalization, 61 
Delta function potential, 107-108, 
206 

A-variation, 476 
6-variation, 474-475 
de Moivre formula, spin 

generalization of, 387 
Density matrix: 
spin one-half, 392-399 
photons, 587-589 
Density of states, 62, 65-67, 501, 
504, 578 

Density operator, 319, 322, 363, 
370 

for chaotic state, 589 
for thermodynamic equilibrium, 
564-565 

Detailed balancing, 493 
Determinantal equation, 140-141. 

See also Secular equation 
Deuteron, energy and wave 
function, 275 

Diagonalization of normal matrix, 
209-211 

in degenerate perturbation 
theory, 463 

Diamagnetic susceptibility, 481 
Diatomic molecule, model of, 112, 
163-165 

Dichotomic variable, 378 
Differential cross section, see 
Cross section 

Dipole moment, see Electric dipole 
moment. Magnetic moment 
Dirac, Paul A. M., 196, 594 
Dirac equation: 
for electron, 469, 596-600 
for free particle field, 606-608 
Dirac field, 594 
adjoint, 599 
Dirac Hamiltonian, 596 
Dirac matrices: 
a and j3 matrices, 597 
y matrices, 598-599 
physical interpretation, 617-620 
standard representation, 597, 603 
Dirac picture, see Interaction 
picture 

Dirac spinor, 595 
Direct integral, 479, 553, 558 
Direct product: 
of irreducible representations^, 
431-432 

of matrices, 358, 431 
of vector spaces, 358, 426, 
430-431 


Discrete spectrum, 43-44, 83, 181 
Dispersion function, 166 
extended-zone scheme, 172 
reduced-zone scheme, 172. See 
also Brillouin zone 
repeated-zone scheme, 167, 173 
Displacement operator, 68-71, 165, 
225 

eigenvalue problem of, 70 
Distorted plane wave, 300 
Double oscillator, 149-159 
Double scattering, 376-377, 403 
Double-slit interference, see Two- 
slit interference 

Double-valued representations of 
rotation, 387, 424 
Driven harmonic oscillator, see 

Forced harmonic oscillator 
D(R)-matrix for rotations, 423, see 
also Rotation matrix 
Dual vector space, 196 
Dynamical variable, 38, 53, 57 

Ehrenfest’s theorem, 36-37 
Eigenfrequency, 5 
Eigenfunction, 42,54 
Eigenket, 198 
Eigenstate, 54 

Eigenvalue, 42, 54, 140, 198 
Eigenvalue problem for normal 
operator, 207-214 
Eigenvector, 198 
Einstein, Albert, 2 
Einstein A coefficient, 580 
Einstein, Podolsky, Rosen (EPR), 
361-362 

Einstein principle of relativity, 600 
Elastic scattering, 284-286, 518 
of alpha particles, 284-286 
of electrons by hydrogen atoms, 
534 

Electric dipole moment, 441, 
459-463 

matrix element of, 88, 489, 496, 
579 

induced, 461 
permanent, 461 

Electric dipole (El) transition, 489, 
496, 579 

selection rules for, 436, 441, 497 
Electric multipole expansion, 437, 
see also Multipole expansion 
of Coulomb interaction 
Electric multipole (2€) moment, 
437, 488 

parity and time reversal selection 
rules, 444 

Electric quadrupole transition, 516 
Electromagnetic field, quantization 
of, 569-573 

Electron, relativistic theory of, 
592-629 

in Coulomb field, 625 
in magnetic field, 620, 629 
Electron-positron field, 592 
charge of, 593-596 
energy of, 593-596 
momentum of, 593-595 
Electrostatic polarization of atoms, 
459-463 

Elementary excitation, see 

Quantum; Quasiparticle 
Emission probability, 577-579, 
581-582 


Energy bands, allowed and 
forbidden, in periodic 
potential, 166-178, 481 
Energy eigenvalues, 42-44 
of configuration p 2 , 568 
for delta function potential, 
107-108, 208 

for double oscillator, 153-155 
for exponential well (5 states), 
275 

for free particle: 
nonrelativistic, 63 
periodic boundary conditions, 
64-66 

relativistic, 592, 606 
for harmonic oscillator, 83 
for helium atom, 477-480, 505, 
560 

for hydrogen(ic) atom: 
nonrelativistic, 267, 269 
nuclear size effect, 277 
relativistic, 627-628 
for linear potential, 124 
for particle in a box: 

one dimensional, 106-107 
three dimensional, 66 
for positronium, 274-275 
for scattering states, 522 
for square well: 
one dimensional, 105-106 
three dimensional, 262-263 
Energy gap, for periodic potential, 
168, 172, 481 
Energy level crossing, see 
Crossing of energy 
eigenvalues 

Energy levels, see Energy 
eigenvalues 

Energy normalization, 63-64 
Energy operator, 35, 54 
Energy representation, 206, 334 
Energy shell, 531 
Energy shift, 144, 452, 476, 522 
Energy transfer, average: 
in inelastic process, 486, 
489-490 

in absorption from light pulse, 
494 

Ensemble, see Statistical ensemble 
Entangled state, 362, 406 
Entropy, 636. See also Outcome 
entropy, von Neumann 
entropy, Shannon entropy 
Equation of motion: 
for density matrix, 395 
for density operator, 319, 322, 
369 

for expectation value, 37, 319 
in integral form, 338 
for operator, 321-322, 
in second quantization, 550- 
551 

for spin state, 390-392 
for state vector, 317, 482 
for wave function, 37, 41, 348, 
615-616 

Equivalence transformation, 418 
Equivalent representations, 418 
Euclidean principle of relativity, 
410 

Euler angles, 242, 424, 449-450 
Euler-Lagrange equation, 135 
Euler’s theorem for homogeneous 
functions, 48 
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(change, 154-155, 477-479, 553, 
562 

(change degeneracy, 477 
(change integral, 479, 558 
(cited states, 44 
cciton, 222 

(elusion principle, see Pauli 
exclusion principle 
(pansion of wave packet, in terms 
of: 

coherent states, 227 
Dirac wave functions, 616 
eigenfunctions of an observable, 
60-62 

momentum eigenfunctions, 15, 

19, 30, 63, 286 
oscillator eigenfunctions, 89 
scattering eigenfunctions, 287, 
294 

spherical harmonics, 249 
xpansion postulate, 46, 59 
xpectation value, 29, 198, 634 
of Dirac matrices, 619-620 
in Dirac theory, 615 
equation of motion for, 37, 319 
of function of position and 
momentum, 33, 35 
of observable, 198 
of operator in spin space, 393 
of position and momentum, in 
coordinate and momentum 
space, 29-32 

xponential decay, 133, 510-513 
xponential operator, 39-41 
xponential potential, 275 
xtended Euclidean principle of 
relativity, 439 

ermi, Enrico, 507 
ermi-Dirac: 

anticommutation relations, 543, 
547 

statistics, 543, 566 
ermi gas, 567 
ermions, 543 
eynman, Richard F., 385 
eynman path integral, 355-357 
ield operators, 546, 551 
bilinear functions of, 605 
ine structure, of one-electron 
atom, 470, 627-628 
ine structure constant, 471-472, 
641 

lavor, quantum number, 536 
loquet’s theorem, 166 
ock space, 537 

oldy-Wouthuysen transformation, 
618 

orbidden energy gap, 168, 172, 

481 

orbidden transitions, 496-497 
orced harmonic oscillator, 
335-342, 354, 486, 580 
orm factor for scattering from 
finite nucleus, 534 
orward scattering amplitude, 302, 
533 

ourier analysis, 15, 64, 630 
ranck-Hertz experiment, 1 
ree energy, generalized, 565 
ree particle eigenfunctions: 
nonrelativistic, 44, 62-65 
normalization of, 63 
with sharp angular momentum, 
257-262 


relativistic, 606-609 
Free particle motion: 
one dimensional, 22-24 
propagator for, 35 1 
relativistic, 618-619 
Functional integral, 357 
Function of a normal operator, 212, 
217 

Galilean transformation, 5-6, 

75-78 

Gamow factor, 313 
Gauge, 73 
field, 75, 445 
invariance, 71-75, 347 
natural, 75 
principle, 599 
symmetry, 75, 444-447 
theory, electroweak, 538 
Gauge transformation: 
global, 73 

local, 73, 347, 444-447 
Generalized free energy, 565 
Generating function: 
for associated Laguerre 
polynomials, 271 
for Hermite polynomials, 85-86 
for Legendre polynomials, 
246-247 
Generator: 

of infinitesimal rotations, 236, 
330-331, 382, 449 
of infinitesimal transformations, 
328 

of infinitesimal translations, 
70-71, 236, 330 
g-factor: 
for atom, 439 
for electron, 620-621 
for nucleus, 449 
Golden rule of time-dependent 
perturbation theory, 

503-510 

Gordon decomposition of Dirac 
current, 623 

Good quantum number, 473 
Goudsmit, Sam, 374 
Grand canonical ensemble, 565 
Grand canonical potential, 565 
Grand partition function, 565 
Green’s function, 290, 349, 454 
advanced, 293, 337, 350 
for harmonic oscillator, 337 
incoming, 293 
outgoing, 293 

partial wave expansion of, 308 
in perturbation theory, 457-458 
retarded, 293, 337, 349-350 
in scattering theory, 290-295, 
523-524 

standing wave, 293 
for wave equation, 349-350 
Green’s operator in scattering, 
523-524 

Ground state, 44, 84, 222 
variational determination of, 136, 
213 

Group, definition, 69, 416-417 
Group representation, 416-421 
Group velocity, 19, 175 
Gyromagnetic ratio, 374, 398. See 
also g-factor 

Hamiltonian, for particle in 

electromagnetic field, 72 


Hamiltonian operator, 37, 317, 348 
Hamilton-Jacobi equation, 23, 25, 
114, 352 

Hamilton’s equations, 80, 324 
Hamilton’s principal function, 
23-24, 354-355 
Hamilton’s principle, 355, 357 
Hankel functions, spherical, 260 
Hard sphere scattering phase shift, 
303 

Harmonic oscillator, 79-89, 205, 
220-225 

coupled, 371, 480 
in Hartree-Fock theory, 568 
density operator for, 590 
eigenfunctions, 47, 83-89, 224 
recursion relations for, 81-82 
energy eigenvalues, 2, 83, 125 
in Heisenberg picture, 333 
in momentum representation, 34, 
47 

propagator for, 352 
and reflection operator, 440 
in thermal equilibrium, 91 
three-dimensional isotropic, 276, 
480 

and time development of wave 
packet, 49 

two-dimensional isotropic, 276, 
567 

and uncertainties in x and p, 49 
WKB approximation for, 125 
zero-point energy for, 84 
Hartree-Fock equations, 562 
Hartree-Fock method, 560-564 
Hartree units, 641 
Heaviside step function, 93, 342, 
633 

Heisenberg, Werner, 18 
Heisenberg (spin) Hamiltonian, 

567 

Heisenberg picture, 320, 550 
applied to harmonic oscillator, 
333-335 

and canonical quantization, 

* 321-322 

for Dirac field, 598 
in one-particle Dirac theory, 
617-621 

Heisenberg uncertainty principle, 

18 

Heisenberg uncertainty relations, 
20-22, 217-220 
for angular momentum 
components, 240 
for energy and time, 21-22, 43 
for position and momentum, 
14-18, 20, 219, 229, 
231-232 

in second quantization, 553 
Helicity, 449, 569, 576 
Helium atom: 

energy levels, 477-480, 505 
stationary states, 477, 560 
Hellmann-Feynman theorem, 175, 
178, 465, 476 

Hermite polynomials, 84-86 
completeness of, 88 
differential equation for, 84 
generating function for, 85 
integral representation for, 88 
normalization of, 87 
orthogonality of, 87 
recurrence relation for, 84, 
224-225 
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Hermitian adjoint operator, 197. 

See also Adjoint of an 
operator 

Hermitian conjugate (adjoint) of a 
matrix, 100, 192, 380 
Hermitian matrix, 193, 380 
Hermitian operator(s), 51-56, 192. 
See also Normal operator(s) 
eigenvalue problem for, 54, 
212-214 

as observables, 53, 179-180 
Hermitian scalar product, see Inner 
product 

Hidden variables, 9, 18 
Hilbert space, 185 
Hindered rotation, 158, 481 
Hole state in shell model, 567 
Hole theory and positrons, 618 
Holonomy, 447. See also Berry’s 
phase 

Hydrogen(ic) atom, 265-275, 623-628 
degeneracy in, 267, 468-469, 

628 

effect of electric field on, 

459-460, 467-469, 
eigenfunctions of, 270-275 
recursion relations for, 266 
emission of light from, 580 
energy levels of, 267, 269, 627-628 
fine structure of, 627 
lifetime of excited state, 580 
linear Stark effect, 467-469 
in momentum space, 502 
parity in Dirac theory of, 624 
reduced mass effect in spectrum, 
274 

relativistic correction to kinetic 
energy, 481 

and rotational symmetry in four 
dimensions, 268-270 
and WKB method, 275 

Ideal experiment (measurement), 

406, 408, 515 

Ideal gas, in quantum statistics, 

565 

Idempotent operator (matrix), 69, 

189, 394 

Identical particles, 535 
quantum dynamics of, 549-552 
and symmetry of wave function, 
547 

Identity operator, 189 

Impact parameter, 282, 488-489 

Impulsive change of Hamiltonian, 

342 

Impulsive measuring interaction, 

408 

Incoherent sum, partial wave cross 
sections, 301 

Incoming spherical waves in 
asymptotic scattering 
eigenfunctions, 294, 502, 

524 

Incoming wave, 100 
Incoming wave Green’s functions, 
293, 524 

Incompatibility of simultaneous 
values of noncommuting 
observables, 53, 407 
Indenumerable set of basis vectors, 
202 

Independent particle 

approximation, 559, 

560-561, 564 


Indeterminacy principle, see 
Heisenberg uncertainty 
principle 

Indistinguishability of identical 
particles, 535-538 
Induced electric dipole moment, 
see Electric dipole moment 
Infinitely deep well, 106-107, 275 
Infinitely high potential barrier, 95 
Infinitesimal displacement, 234 
Infinitesimal rotations, 235, 330, 
382 

representation of, 423 
Infinitesimal transformations, 328 
Infinitesimal translations, 70, 236, 
330 

Inflection point of wave function, 
94 

Information, 636 
in quantum mechanics, 363-370, 
403-408 

Infrared vibration, of oscillator, 

159 

Inhomogeneous linear equation, 
453-455 
Inner product: 
bra-ket notation for, 196 
of functions, 59 
of vectors, 187 
“In” states in scattering, 518 
Integral equation: 
for radial wave function, 309 
for scattering state, 293, 521-525 
for stationary state wave 
function, 291 
for time-dependent wave 
function, 549 

for time development operator, 
338 

Interacting fields, 577 
Interaction between states or 

energy levels, 146, 167, 178 
Interaction picture, 323, 483 
Interference, 8, 12-13, 184-185, 
357, 583-589 
Intermediate states, 509 
Internal conversion, see 

Radiationless transition 
Interpretation of quantum 
mechanics: 

Copenhagen, 18 
ontological, 9, 29 
realistic, 9 

statistical, 25-29, 408 
Interval rule, 470 
Intrinsic angular momentum, 

372-377, 389. See also Spin 
Invariance: 

under canonical transformations, 

329- 330 ; 

under charge conjugation, 
609-610 

under CP transformations, 409 
under gauge transformations, 
71-75, 347, 445 
under Lorentz transformations, 
600-605 

under reflections, 81, 101, 441, 
460, 605 

under rotations, 233, 269-270, 

330- 331, 383, 390, 414 , 
530-532 

under time reversal, 46, 100, 

167, 442, 612 

under translations, 165, 330 


Invariant subspace, 419 
Inverse of an operator, 69, 194 
Inversion of coordinates, 249, 439, 
605 

Irreducible representations 
(“irreps”), 418, 421 
of rotation group, 423 
of translation group, 166 
Irreducible spherical tensor 
operator, 434 

commutation relations for, 436 
selection rules for, 435-436 
time reversal properties of, 
443-444 

Isobaric spin, see Isospin 
Isometric mapping, 411 
Isometric operator W, in scattering, 
530 

Isospin, 445, 536 

Joining conditions for wave 

function, 46, 95, 104, 157 
Jones vector, 395 

Kernel for Schrodinger equation, 
349. See also Green’s 
function 
Ket, 196 

Kinetic energy operator, 35 
and orbital angular momentum, 
252-255 

Klein-Gordon equation, 621 
k-Normalization, 63 
Koopman’s theorem, 563 
Kramers, Hendrik A., 113 
Kramers degeneracy, 442, 612 
Kronig-Penney potential, 168-169, 
481 

Kummer function, 156 

Ladder method: 
for angular momentum, 239 
for harmonic oscillator, 221 
Lagrangian multiplier, 136 
Laguerre polynomials, associated, 
271 

and confluent hypergeometric 
functions, 270-271 
Lamb shift, 628 
Lande g-factor, see g-factor 
Lande’s interval rule, 470 
Laplacian, in curvilinear 
coordinates, 639 
Larmor precession, 398 
Laser, as two-level system, 500 
Laue condition, 295 
Law of large numbers, 635 
Legendre’s differential equation, 
244 

Legendre polynomial expansion see 
Partial wave expansion 
Legendre polynomials, 245 
completeness of, 249 
generating function for, 246 
normalization of, 246 
orthogonality of, 246 
recurrence fromula for, 247 
recursion relation for, 345 
Levi-Civita (antisymmetric) tensor 
symbol, 252 

Lie group, semisimple, 421-422 
Lifetime of decaying state, 133, 
307-308, 513-514, 580 
Light pulse, and its absorption, 
492-494 
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ight quantum, see Photons 
inear displacement operator, see 
Translation operator 
inear harmonic oscillator, see 
Harmonic Oscillator 
inear independence, 1 86 
of eigenfunctions, 55 
inear momentum, see Momentum 
inear operator, 34, 188, 412 
inear potential, 123 
energy eigenvalues for, 124 
ground state wave function for, 
139 

variational estimate for ground 
state of, 138 

and WKB approximation, 139 
inear Stark effect, 467-469 
inear vector space, see Vector 
space 

ine broadening, 500 
ine shape, 514 

ippmann-Schwinger equation, 522 
ocal interaction, of identical 
particles, 549 

ogarithmic derivative of wave 
function, 105, 302, 304 
ongitudinal polarization of 
particle with spin, 400 
orentz boost, 607 
orentz equation, 618 
orentz group, 600 
orentz transformation, 601 
infinitesimal, 602 
proper orthochronous, 601 
owering operator, 221. See also 
Annihilation operator 
eigenvalue problem for 225 
-S coupling, 559 ' 

-uminosity, 279-280 

ladelung flow, 28 
lagnetic moment, 372-375, 438 
of atom, 438-439 
of electron, 374, 388, 620, 622 
of nucleus, 449 
lagnetic quantum number, 244 
lagnetic resonance, 399 
laser, as two-level system, 500 
latching conditions, see Joining 
conditions for wave function 
latrix element(s): 
of operator, 191, 198 
in coordinate representation, 
204, 345 

in oscillator energy 
representation, 88, 223-224 
latrix mechanics, 142 
latrix methods, for transmission 
and reflection in one 
dimension, 97-99, 108-109 
latter waves, 2 

laxwell-Boltzmann statistics, 566 
laxwell equations, 573 
leasurement of observables, 53, 
57, 364, 370, 403-408 
ideal, of first kind, 408 
lehler’s formula for Hermite 
polynomials, 89, 353 
linimum uncertainty (product) 
state (wave packet), 220, 
229-230, 232 

time development of, 333,351 
lixing entropy, see Shannon 
entropy 


Mixture of states, 365, 399 
Mode(s), of elastic medium or 
field, 4, 569, 584 
Momentum eigenfunction, 62-65 
partial wave expansion of, 261 
Momentum: 
canonical, 72 

expectation value of, 32, 36, 90 
kinetic, gauge invariant, 74 
local, 115 

of photon field, 574-575 
radial, 255 

Momentum operator, 35, 62, 71, 204 
matrix element of, 205 
Momentum representation, 30-33 
and equation of motion, 30-31, 
347-348 

for harmonic oscillator, 34, 47, 
329 

for hydrogen atom, 502 
wave function in, 180, 345 
Momentum space, see Momentum 
representation 
Momentum transfer, 296 
Multinomial distribution, 635 
Multiple scattering, 286 
Multiplet, of spectral lines, 420 
Multipole expansion, of Coulomb 
interaction, 308, 488, 507, 

568 

Nats, 367 
Natural units, 641 
Negative energy states in Dirac 
electron theory, 616 
Negative frequency part of field, 

572, 594 

Neumann, John von, 52 
Neumann functions, spherical, 259 
Neutral kaon, decay of, 408-409 
Neutrino, 629 

Nodal line, defining Euler angles, 
424 

Nodes: 

as adiabatic invariants, 151 
of oscillator eigenfunctions, 87 
of hydrogen atom eigenfunctions, 
274 

of square well eigenfunctions, 

106 

of WKB bound state wave 
function, 122 

Noncrossing of energy levels, 465 
Nonorthogonal basis functions, 
146-149 

Nonorthogonal projection 

operators, for generalized 
measurement, 364-365 
Nonrelativistic limit of Dirac 
theory, 622 

No-particle state, 222, 537 
Norm, of state vector, 59, 187 
Normalization, 27-28, 57, 187 
of associated Laguerre functions, 
270 

of associated Legendre functions, 
247 

of coherent states, 225 
of continuum eigenfunctions, 61, 
203 

of Coulomb eigenfunctions, 313 
of free particle eigenfunctions, 
62-65 

of hydrogen eigenfunctions, 270 


of identical particle states, 556 
of Legendre polynomials, 246 
of momentum space wave 
functions, 31 

of oscillator eigenfunctions, 87 
of perturbation eigenvectors, 
456-457 

of radial eigenfunctions, 263 
in continuum, 300 
of scattering states, 527 
of spherical harmonics, 249 
of spinors, 393 
Normal operator, 195 

eigenvalue problem of, 207-211 
Normal ordering of operators, 228, 
558 

Null vector, 1 87 

Number of particles operator, 83, 
222, see also Occupation 
number operator 

0(n), orthogonal group, 421 
Observables, 59, 180 
commuting and compatible, 
214-217, 407 
complete set of, 180, 216 
simultaneously measurable, 180, 
214-217 

Occupation number operator, 537, 
542 

Old quantum theory, 241 
One-electron atoms, spectra of, 
469-471 

One-electron state(s), relativistic, 
613-614 
One-form, 196 

One-particle operator, additive, 
544-545, 615 
Opacity of barrier, 127 
Operators, 34-38, 188-195 
algebra of, 38-41 
Optical potential, 27 
Optical theorem, 103, 112, 302, 
532-533 

Orbital angular momentum, 

233-255, 425-426, 443 
eigenvalues: 

of component of, 242-244 
of magnitude of, 244-245 
Orbital angular momentum 
quantum number, 245 
Ordering: 

of noncommuting operators, 33, 
325 

normal, 228, 558 
time, 338, 484 
Orthogonality: 

of continuum eigenfunctions, 61 
of eigenfunctions of Hermitian 
operators, 55 
of eigenvectors of normal 
operators, 208-209 
of scattering states, 527 
qf spinors, 379 
of state vectors, 187 
of stationary states, 43 
Orthohelium, 480, 560 
Orthonormality, 56, 187 
Orthonormal set, basis vectors, 55, 
187, 201, 537 

Oscillator, see Harmonic Oscillator 
Oscillator strength, 488 
Outcome entropy, 368, 404-405 
Outer product, see Direct product 



Index 


653 


Outgoing spherical waves in 
asymptotic scattering 
eigenfunctions, 287, 294, 
502, 523 

Outgoing wave, 100 
Outgoing wave Green’s function, 
293, 523 

“Out” states in scattering, 518 
Overcomplete set of coherent 
states, 227, 365 
Overlap integral, 147, ,153 

Pair, electron-positron, annihilation 
of, 616 

Pair density operator, 567 
Pair distribution operator, 545 
Pair state, 556 

Parabolic coordinates, 310, 462, 
639 

Parabolic cylinder functions, 
156-157 

Parahelium, 480, 560 
Parity, 81, 440 
and angular momentum, 249 
conservation of, 441, 460 
in Dirac theory, 605, 610-611 
and electric dipole moment, 441, 
460 

nonconservation of, 441 
operator, 249, 441, 605 
selection rules, 441 
in spin space, 440 
Parseval’s equality, 59 
Partial wave cross section, 301 
Partial wave expansion: 
of delta function, 252 
of Green’s function, 308 
of plane wave, 261 
of scattering amplitude, 301, 531 
of S matrix, 531 

Particle-antiparticle transformation, 
408, 608-610 

Particle-antiparticle oscillation, 

409 

Particle density operator, 553, see 
also Probability density 
operator 

Particle in a box, 66-67 
Partition function, 637 
Passive transformation, 76-77, 

201, 602 

Pauli exclusion principle, 543 
Pauli spin matrices, 386, 603 
PCT theorem, 613 
Penetrability, 128 

Penetration of potential barrier, see 
Barrier penetration 
Periodic boundary conditions, 45, 
64-66, 107 

Periodic potential, 156-176 
eigenvalue problem for, 168-173 
perturbation theory for, 48 1 
Perturbation, 128 
Perturbation expansion, 452, 475 
arbitrary constants in, 456-457 
to first order, 452-453, 455-459 
to second order, 456-459, 
461-462 

Perturbation theory, 142-146, 
451-459 

for degenerate levels, 144-14'6, 
463-465 

for n-electron atom, 558-560 
Phase integral, 2, 122 


Phase shift, 110, 298-309, 631 
Born approximation for, 307 
integral formula for, 309 
in transmission through a barrier, 
110 

Phase space, in WKB 

approximation, 122 
Phonons, 3, 222 

Photoelectric effect, 501-502, 515 
Photoemission, 515-516 
Photon correlations, 586-589 
Photon field operator(s), 572 
Photons, 3, 222, 569 
absorption of, 492-493, 577-579 
detection of, 583 
emission of, 577-580 
orbital angular momentum of, 

575 

spin of, 569, 575-576 
Picture, of quantum dynamics, 
319-323, 

Heisenberg, 320 
interaction (Dirac), 323, 483 
Schrodinger, 316-320 
Planck’s constant, 1, 348, 641 
Planck’s black-body radiation 
formula, 590 
Plane wave, 13-14, 43 
expansion in spherical 
harmonics, 261 
p-Normalization, 63 
Poincare vector, 395 
Poisson bracket, 326 
Poisson distribution, 227, 341, 582 
Polarizability: 
of atom, 461 
of hydrogen atom, 462 
of isotropic oscillator, 461 
Polarization: 
of electron, 376-377 
of light, 576 

Polarization current density, 623 
Polarization vector, 376, 394 
and density matrix, 392-399, 
403-404 

equation of motion for, 396 
precession of, 396-397 
and scattering, 376-377, 399-403 
for statistical ensemble, 403 
Positive definite operator, 193 
Positive frequency part of field, 

572, 594 
Positron, 592 
vacuum, 614 
wave function, 614-615 
Positronium, decay of, 449 
Positrons, sea of, 618 
Potential: 

Coulomb, 265 
delta function, 107 
double oscillator, 149-150 
double well, 11 
exponential, 275, 
harmonic oscillator, 79 
hindered rotation, 158 
Kronig-Penney, 168 
linear, 123 
periodic, 165 
rectangular barrier, 97 
sectionally constant, 92 
spherically symmetric (central), 
256 

spherical square well, 262 
square well, 103 


Potential barrier, 97 
Potential energy surface, 163 
Potential step, 92 
Poynting vector, 494 
Principal quantum number, 267, 
311, 627 

Principle of complementarity, 4 
Principle of relativity, 75 
Principle of superposition, 12-14, 
57-58 

and time development, 316 
Probability: 

basic theory of, 634-638 
in coordinate and momentum 
space, 29-34 
conservation of, see 

Conservation of probability 
current density, 26-27 
in Dirac theory, 600, 610, 616, 
623 

as expectation value of 
operator, 49 

gauge invariant form of, 74 
represented by Wigner 
distribution, 49, 370 
density, 26-27, 29-30, 203 
in Dirac theory, 616 
as expectation value of 
operator, 49 

in momentum space, 32-34 
represented by Wigner 
distribution, 49 
interpretation, 7, 9, 25-29, 57 
sources and sinks of, 78 
in spin theory 380, 403 
Probability amplitude(s), 8, 59, 

179, 195 

closure relation for, 1 83 
composition rule for, 182 
interference of, 1 82 
as inner product, 195 
orthonormality of, 183 
reciprocal property of, 182 
time development of, 315 
Probability distribution, of radial 
coordinate in hydrogen(ic) 
atom, 274 

Projection operator, 189, 217, 364, 
393, 404 
rank of, 217 
Propagator, 349 

for free particle, 351 
for harmonic oscillator, 352 
Pseudoscalar operator in Dirac 
theory, 605 
Pure state, 366 

Quadratic integrability, 27 
Quadratic Stark effect, 460 
Quadrupole approximation, 516 
Quadrupole interaction, 450 
Quantization postulates, rules, 
323-326 

Quantum (quanta), 3, 222 
Quantum condition, 2, 122 
Quantum correlations, 228,262 
Quantum defect, 268 
Quantum chromodynamics (QCD), 
538 

Quantum electrodynamics (QED), 
538, 577 

Quantum field operator, 546 
Quantum field theory, 551 
Quantum fluctuations, 228 
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Quantum measurement theory, 
363-365, 370, 408 
Quantum number(s), 84, 473 
group theoretical meaning, 422 
Quantum of action, 1 
Quantum potential, 29, 354 
Quantum theory of radiation, 501 
Quarks, 536 

Juasiclassical states, 228, see also 
Coherent states 
Juasiparticle, 222 
Juasiparticle transformation, 231 

?(3), rotation group in three 
dimensions, 421 
Radial Dirac equation, 625 
Radial eigenfunction, 257 
boundary condition for, 263 
Radial Schrodinger equation, 257, 
263-265 

Radiation, see Absorption and 
Emission of radiation 
Radiation field, quantum theory of, 
569-576 

Radiationless transition(s), 
504-505, 507-508 
Raising operator, 221, See also 
Creation operator 
Randomness, 366-367 
Random variable, 638 
Rank: 

of group, 421 

of projection operator, 217, 364 
Rate of transition, 503-510, 
520-521 

Rayleigh-Ritz trial function, 
139-142 

Rayleigh-Schrodinger perturbation 
theory, 451-459 

and variational method, 473-476 
Reciprocal basis, 147 
Reciprocal lattice, 71, 167-168 
Reciprocal lattice vector, 314 
Reciprocity relation, 532 
Rectangular potential barrier, 97 
Rectangular well, see Square well 
Reduced matrix element, 435 
Reduction: 

of direct product representation, 
431-432, 557 

of group representation, 418 
of state by measurement, 408 
Reflection, 439 
of coordinates, 81 
and rotation, 440-441 
of incident wave, 96 
Reflection coefficient, 96 
Reflection operator, 440 
Regeneration, of amplitudes, 407 
Relative motion, 149, 274, 

359-360 

Relative probabilities, 28 
Relativistic invariance of Dirac 
equation, 600-606 
Relativistic Schrodinger equation 
for scalar particle, 621, 629 
Relativistic wave equation for 
electron, 621 

Repeated eigenvalues, 56, 207, 214 
Representation, of groups, 417-421 
in quantum mechanics, 191,199. 
See also Coordinate 
representation; Energy 
representation; Momentum 
representation 


of rotations, 417, 421-426 
in spin space, 382-385, 388 
of state, by entire functions, 228 
Repulsion of perturbed energy 
levels, 462 

Resolvent operator, 525 
Resonance, in spin precession, 397 
magnetic, 399 
Resonance(s), 110 
profile of, 514 
in scattering, 289, 304-308 
spacing of, 130 
in transmission, 109-111 
and wave packets, 130-133, 289, 
307 

width of, 130, 133, 304-306, 

514 

in WKB approximation, 130 
Riesz representation theorem, 188 
Rigid rotator, 480 
Rotation matrix, 383-384, 387, 
423-426 

symmetry relations for, 425, 443 
Rotation operator, 381-382, 413 
Rotations, 234-236, 381-385, 417 
Runge-Lenz vector, 268 
Russell-Saunders ( L-S ) coupling, 
559 

Rutherford scattering cross section, 
284, 297, 312-313 

Saturation of absorption line, 500 
Scalar operator, 236-237 
Scalar operator in Dirac theory, 

605 

Scalar product, see Inner product 
Scattering, 278-313 
in Coulomb field, 310-313 
of particles with spin, 399-403 
by square well, 108-11 
of wave packets, 286 
Scattering amplitude, 289, 295 
in Born approximation, 296 
partial wave expansion of, 301 
and scattering matrix, 531 
for spin one-half particle, 399 
and transition matrix, 524 
Scattering coefficient, 111, 533 
Scattering cross section, see Cross 
section 

Scattering equation, 525-527 
Scattering matrix, 400, 519, 527, 
see also S matrix 
invariance of, 400, 530-532 
one-dimensional analogue of, 
99-103 

Scattering operator, 340, 528 
relation to time development 
operator, 529 
unitarity of, 529 
Scattering phase shift, see Phase 
shift 

Schmidt orthogonalization method, 
55-56, 207 

Schmidt values for magnetic 
moment of nucleus, 449 
Schrodinger, Erwin, 5 
Schrodinger equation, 42 
time-dependent, 25 
for relative motion, 359-360 
for two particles, in configuration 
space, 359 

Schrodinger picture, 316-320, 617 
Schrodinger representation, 345 
Schrodinger’s cat, 362 


Schur’s lemma, 421 
Schwarz inequality, 193 
Screened Coulomb potential, 277, 
297 

Screening constant for helium 
atom, 478-479 

Second-order equation in Dirac 
theory, 621 

Second -order perturbation theory, 
time-dependent, 508-509 
Second quantization, 551 
Sectionally constant potential, 
92-112 

Secular equation, 140, 209, 464, 
473 

Selection rule, 90 
for CG coefficients, 428 
for electric dipole transition, 497 
for electric multipole moments, 
437, 441, 444 

for irreducible tensor operators, 
435-436 

relation to symmetry, 466 
Self-adjoint operator, 52, 192 
Self-consistent solution, 552, 563 
Self-reciprocal basis, 147 
Semiclassical approximation, 24, 
113 

Semiclassical state, 228, see also 
Coherent state 

Separable Hilbert space, 1 85 
Separable scattering potential, 534 
Separable two-particle wave 
function, 359, 361 
Separation of variables, 257, 270 
Shannon (mixing) entropy, 367, 
403, 636 

Shells, atomic, 559 
Similarity transformation, 200 
Simple eigenvalue, 56 
Simple scattering, 518 
Simultaneous measurements, 180, 
214-217 
Singlet state, 431 
Single-valued wave function, 45, 

4 243 

Slater determinant, 564 
S matrix, 100-103, 530-532 
eigenvalue of, 302, 532 
poles of, 105 
unitarity of, 529 

SO(n), special orthogonal matrices, 
n dimensions, 421 
S operator, see Scattering operator 
Space quantization, 373 
Spectral decomposition, 217 
Spectroscopic stability, principle 
of, 499 

Spectrum, 54, 181 
of Schrodinger equation, 44 
Spherical cylinder functions, 

Bessel, Hankel, Neumann 
functions, 259-260 
Spherical harmonics, 248-252 
and harmonic functions, 254 
in momentum space, 443 
reflection properties of, 249 
and rotation matrices, 425-426 
Spherical polar coordinates, 242 
Spin, 372, 390 
of photon, 575-576 
operators, 385-390 
and statistics, 543, 556 
quantum dynamics of, 390-392 
total, 430-431 
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Spin filter, 408 

Spin flip amplitude, 401 

Spin matrices, in Dirac theory, 603. 

See also Pauli spin matrices 
Spin one-half bosons, 556-558 
Spin-orbit interaction, 389, 399, 
416, 469-473, 480 
Spinors, 379,595 
Spin polarization, 392-399. See 
also Polarization 
Spinor wave function, 378, 614 
Spins, addition of, 430-431 
localized, 567 
Spin variable, 378 
Splitting of degenerate energy 

levels, 154-155, 178, 468, 
472-474, 480 

Spontaneous emission, 501, 
579-580 

Spontaneous symmetry breaking, 
151-152 

Spreading of wave packet, 20-21, 
24, 49, 333, 351 
Square well: 

in one dimension, 92-93, 103 
eigenvalues and eigenfunctions 
of, 103-108 

transmission through, 108-111 
in three dimensions, 262-263 
Squeezed states, 230-231, 343 
S state, 245, 265 
as ground state, 263 
Standing wave Green’s function, 
293, 524 

Stark effect, 460, 462 
linear, of hydrogen, 467-468 
State, 28, 185 

pure, mixed, and unpolarized, 
366, 399 

State vector, 185, 388 
and wave function, 203 
Stationary state, 41-47, 334-335 
Statistical ensemble, density matrix 
for, 366, 399 

Statistical thermodynamics, 
369-370, 564-567 
Statistics of particles, 554 
Step function, see Heaviside step 
function 

Stern-Gerlach experiment, 

373-374, 406-408 
Stieltjes integral, 181 
Stimulated emission, 493, 499, 578 
Stochastic process, 510 
Stokes parameters, 395, 404 
Sturm-Liouville equation, 59, 121, 
261 

SU( 2) group, 387 
SU(n), special unitary group, n 
dimensions, 421 
Sudden approximation, 342 
Sum rule: 

for electric dipole cross section, 
489-490 

generalization of, 516 
for oscillator strengths, 489 
Thomas-Reiche-Kuhn, 489, 516 
Superposition of states, see 

Principle of superposition 
Superposition of stationary states, 
44 

Superselection rule, 414, 612 
Symmetric top, 450 
Symmetry: 
chiral, 620, 622 


four dimensional rotation, 
269-270 

local gauge, 444-447 
reflection, 101 
rotational, 390 
of Schrodinger equation, 102 
of S matrix, 101, 105, 400 
time reversal, 100-101 
translational, 165 
Symmetry group, 417 
Symmetry operation, 411 

Tensor operator, 432-437 
Tensor operator in Dirac theory, 

605 

Tensor of polarizability, 461 
Tensor product, see Direct product 
Thermal equilibrium, 369, 564 
Thomas-Reiche-Kuhn sum rule, 

489, 516 

Thomas (precession) term, 470 
Tight-binding approximation, 167 
Time delay in scattering resonance, 
110,307 

Time-dependent perturbation 
theory, 485-487 
Time-dependent Schrodinger 

equation, 22, 25, 41, 44, 46 
Time development: 
of x, p, Ax, and Ap, 49, 

332-333, 351 

of density operator (matrix), 319, 
322, 369-370, 395 
of operators, 321, 332 
of physical system, 41-44, 
315-319 

of polarization vector, 396-398 
of spin state, 390-392 
of state vector, 317, 482 
Time development (evolution) 
operator, 41, 316, 484 
Time-independent Schrodinger 
equation, see Schrbdinger 
equation 

Time-independent wave function, 

42 

Time-ordered product, 338, 484 
Time reversal, 100, 441-444 
in Dirac theory, 611-612 
in scattering, 532 
Total angular momentum, see 
Angular momentum 
Transfer matrix, 169 
Transformation coefficients, 199, 
201, 205, 346, 538-539 
Transition amplitude, 316, 323, 484 
Transition current density, 26, 623 
Transition matrix (element), 
519-521 

Transition probability per unit 

time, see Rate of transition 
Translation operator, 69, 165, 
Transmission coefficient, 96, 109, 
126, 533 

Transmission through barrier 
(WKB), 125-133 
Transpose of an operator, 192 
Transposition of matrix, 595 
Triangular condition for adding 
angular momenta, 428 
Triplet state, 431 
Tunneling, 97-98, 125-133, 155, 
167 

Turing’s paradox, see Zeno’s 
paradox 


Two-component theory of 

relativistic spin one-half 
particle, 629 
Two-level system, 391 
Two-particle matrix element, 
545-546 

diagonal form, 551 
Two-particle operator, 545, 555 
Two-particle state, 555-556 
Two-particle system, relative 
motion of, 359-360 
Two-photon emission, 591 
Two-slit interference, 8-9, 12, 
183-185, 584-546 

Uhlenbeck, George E., 374 
Uncertainties, 218 
Uncertainty principle, 18 
Uncertainty relation, see 

Heisenberg uncertainty 
relation 

Unimodular matrix, 385, 387 
Unitary matrix, 100, 195, 382, 
Unitary operator, 68,. 194, 
eigenvalues of, 210 
Unitary symmetry, principle of, 539 
Unitary transformation, 201 
and states of identical particles, 
538-539 

Unitary unimodular group in two 
dimensions, SU( 2), 424 
Units, 640-641 
Unit vector, 187 
Universal covering group, 424 
Unstable particles, 44 

Vacuum expectation value, for 

electron-positron field, 596 
Vacuum state, 222, 537 
Variance, 16, 49, 634 
of observable, 218 
Variational method, 135-139, 
212-214, 474 
accuracy of, 481 
applied to helium atom, 478 
for n identical fermions, 560-562 
and perturbation theory, 473-476 
Variational trial function, 137-140, 
176-178, 276-277, 560 
Vector addition coefficients, see 

Clebsch-Gordan coefficients 
Vector model, in old quantum 
theory, 241, 438 
Vector operator, 236, 383, 388, 
433-434, 438 

commutation relations for, 236, 
384 

Wigner-Eckart theorem for, 438 
Vector operator in Dirac theory, 

605 

Vector potential as quantum field, 
572 

Velocity-dependent interaction, 335 
Velocity operator in Dirac theory, 
617 

Virial theorem, 47-48, 177, 476 
Virtual transition(s), 509 
von Neumann entropy, 368, 564 

Wave equation, 5, 25, 46, 347-348 
in momentum space, 46, 180, 

348 

Wave function, 5, 28, 180, 345 
complex valuedness of, 13 
in configuration space, 345, 547 
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Index 


fave function ( Continued ) 
meaning of, 4 

in momentum space, 47, 345 
for photon, 571 
quantization of, 551 
and Wigner distribution, 49-50 
/ave mechanics, 142, 205 
/ave packet, 14-18, 24 
collapse of, 408 
in oscillator potential, 89-90 
scattering of, 286-290 
splitting of, 96 

spreding of, 20-22, 24, 49, 333, 
351 

in WKB approximation, 130-133 


Wentzel, Gregor, 113 
Width of resonance, 130, 133, 306, 
514 

Wigner coefficients, see Clebsch- 
Gordan coefficients 
Wigner distribution, 49-50, 
370-371 

Wigner-Eckart theorem, 386, 435 
applications of, 437-439 
and time reversal, 444 
Winding number, 414 
WKB approximation, 113-134 
applied to radial equation, 
and bound states, 121 — 125 
connection formulas for, 1 16 


conservation of probability in, 
112 

and Coulomb potential 

(hydrogenic atom), 275 
and double well potential, 134 
and periodic potential, 178 
Wronskian, 45, 121, 259 

Yang-Mills field equations, 447 
Yukawa potential, 277, 297 

Zeeman effect, 473-474 
Zeno’s paradox, 514-515 
Zero point energy, 84. 232, 574 
Zitterbewegung, 619 



